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MDR EFFLUX PUMPS 

Background of the Invention 

... , ^ 

5 Most bacterial cells express proteins that function as multidrug 

resistance (MDR) efflux pumps, so named because they confer resistance to a 

large variety of chemically unrelated agents by extruding them from the cell. 

MDR efflux pumps have been described in phylogenetically diverse organisms 

including bacteria (Nikaido, J. Bacterid. 178:5853-5859, 1996), yeast- 
10 (Kolaczkowski et al., Microb. Drug Resist 41:143-158, 1998), and mammals. 

Many MDR efflux pumps are members of multigene families, the best- 

described of which being the gene families encoding the ATP Binding 

Cassette (ABC) membrane proteins, the Multiple Facilitator (MF) superfamily, 

and the small multidrug resistance (Smr) family (Michaelis and Berkower, 
15 Cold Spring Harb. Symp. Quant Biol. 60:291-307, 1995; Paulsen et al., 

Microbiol. Rev. 60:575-608, 1996). MDR efflux pumps have been shown to 

contribute significantly to resistance to antibacterial agents in a number of 

model organisms as well as important Gram-positive pathogens, including 

Staphylococcus (&) aureus (Yoshida et al., J. Bacteriol. 172:6942-6949, 1990; 
20 Hsieh et al., Proc. Natl. Acad. Sci. USA 95:6602-6606, 1998), Enterococcus 

(E.)faecalis (Lynch et al., Antimicrob. Agents Chemother. 41:869-871, 1997), 

and Streptococcus (5.) pneumoniae (Gill et. al., Antimicrob. Agents 

Chemother. 43: 187-189, 1999). 

The determination of the complete DNA sequence of microbial 
25 genomes has progressed markedly over the last few years. Model organisms 

continue to lead the way with the reported sequence of Saccharomyces (5.) 

cerevisiae (Goffeau et al., Nature 387:1-105, 1997), Escherichia (E.) coli 
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(Blattner et al., Science 277: 1453-1474, 1997), Bacillus (B.) subtilis (Kunst et 
aL, Nature 390:249-256, 1997) and Treponema (T.) pallidum (Fraser et al., 
Science 281: 375-388, 1998). Integration of the genetic and biochemical 
information obtained over decades of research with the newly obtained raw 
5 genomic sequence presents an exciting challenge to biologists and 

bioinformaticists. Many open reading frames (ORFs) have been identified and 
catalogued based on homology with known genes, or known folding domains, 
using algorithms such as FASTA and BLAST (reviewed in Storms, 
"Genome-wide strategies for studying gene function by using model systems." 

10 In: Organization of the Prokaryotic Genome, pp. 347-365. Edited by R.L. 
Charlebois. ASM Press, Washington: D C. 1999). Significant efforts have 
been made to use homology relationships between ORFs to establish 
hypothetical phylogenetic relatiomMps between organisms as well as 
genealogy of multigene families wi&ia any given organism. Twomultigene 

15 families that have been examined in detail are those encoding the ABC and 
MF families of proteins. For example, the complete genome of B. subtilis led 
to the identification of no less than 59 possible ABC transporters (Quentin et 
al., J. Mol. Biol. 287:467-484, 1999). Likewise, analysis of 5885 predicted 
ORFs in the complete S. cerevisiae genome identified 186 potential MF 

20 proteins (Nelissen et al., FEMS Microbiol. Rev. 21:113-134, 1997). 

Summary of the Invention 

In a first aspect, the invention features a method of determining 
whether a nucleotide sequence encodes an MDR efflux pump. The method 
includes the steps of: (a) searching a database of nucleotide sequences for 
25 sequences having high identity to a sequence encoding a known MDR efflux 
pump to generate a first set of candidate sequences comprising sequences that 
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have high identity to the sequence encoding the known MDR efflux pump; (b) 
from the first set of candidate sequences, selecting the sequences that include a 
sequence encoding potential transmembrane domains to generate a second set 
of candidate sequences; (c) in a bacterial cell, mutating the gene corresponding 
5 to one of the candidate sequences; and (d) determining whether the bacterial 
cell exhibits increased sensitivity to antibacterial agents, wherein the candidate 
sequence encodes an MDR efflux pump if the bacterial cell exhibits increased 
sensitivity to the antibacterial agents. 

In one embodiment, the sequence encoding the known MDR efflux 

10 pump is a nucleotide sequence, and the searching step (a) includes comparing 
the database of nucleotide sequences with the nucleotide sequmce encoding 
the known MDR efflux pump. 

In another embodiment, the sequence is a polypeptide sequence, and 
< the searching step (a) includes comparing nucleotide sequenced of the database 

15 translated into all six reading frames, to the polypeptide sequence of the known 
MDR efflux pump. 

In a related aspect, the invention features a method of determining 
whether a polypeptide functions as an MDR efflux pump. This method 
includes the steps of: (a) searching a database of polypeptide sequences for 

20 sequences having high identity to a polypeptide sequence that functions as an 
MDR efflux pump to generate a first set of candidate sequences; '(b) from the 
first set of candidate sequences, selecting sequences that have potential 
transmembrane domains to generate a second set of candidate sequences; (c) in 
a bacterial cell, mutating one of the candidate sequences; and (d) determining 

25 whether the bacterial cell exhibits increased sensitivity to antibacterial agents, 
wherein the candidate polypeptide functions as an MDR efflux pump if the cell 
exhibits increased sensitivity to an antibacterial agent. 
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In another aspect, the invention features a method for deleting a 
desired region of DNA in a bacterial cell. The method includes the steps of: 
(a) transforming bacterial cells with a vector that includes (i) a first region of at 
least 30 nucleotides substantially identical to a first region of chromosomal 
5 DNA in the bacterial cells; (ii) a second region of at least 30 nucleotides 

substantially identical to a second region of chromosomal DNA in the ba;;&rial 
cells; and (iii) a third region encoding a polypeptide that provides resistance to 
a selection agent (e.g., kanamycin), wherein the first and second regions of 
chromosomal DNA are on opposite sides of the target region of DNA to .fcsf 

10 deleted; (b) selecting for bacterial cells in which the vector of step (a) haS 

integrated by a first crossover event into the chromosomal DNA by adding the 
selection agent to the culture medium; (c) culturing the cells selected for in 
step (b) in the absence of the selection agent to allow for a second crossover 
event to obbur in at least a subset of the cells, wherein the second crossover 

15 event results in the loss of (i) the desired region of DNA and (ii) the region 
encoding the polypeptide that provides resistance to the selection agent in that 
subset of cells. 

The foregoing method can be repeated one or more times, if desired, 
to delete additional regions of DNA in the same bacterial cell from which the 

20 first region was deleted by repeating steps (a) - (c). Accordingly, the invention 
also features a method for making a multigene mutant bacterial cell. The 
method can be performed in any bacterial cell that can undergo homologous 
recombination, including, for example, Staphylococcus aureus, Streptococcus 
pyogenes, Bacillus anthracis, Clostridium tetani, Clostridium bolulinum, 

25 Vibrio cholerae, Helicobacter pylori, Salmonella typhimuriutn, Shigella 

dysenteriae, Bordetella pertussis, Yersinia pestis, Pseudomonas aeruginosa, 
Neisseria gonorrhoeae, Mycobacterium tuberculosis, Corynebacterium 
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diptheriae, Borrelia burdorferi, Treponema pallidum, Enterococcusfaecalis, 
Enterococcusfaecium, and Streptococcus pneumoniae. 

. In yet another aspect, the invention features a method for 
determining whether a test substance inhibits the growth or metabolism of cells 
5 of a strain of E. faecalis bacteria having a disruptive mutation in a gene 

encoding a protein selected from the group consisting of Abel (SEQ ID NO: 
2), Abc2 (SEQ ID NO: 4), Abc3 (SEQ ID NO: 6), Abc4 (SEQ ID NO: 8), 
Abc5 (SEQ ID NO: 10), Abc6 (SEQ ID NO: 12), Abc7 (SEQ ID NO: 14), 
Abc8 (SEQ ID NO: 16£ Abc9 (SEQ ID NO: 18), AbclO (SEQ ID NO: 20), 

10 Abel 1 (SEQ ID NO: 1>2), Abcl2 (SEQ ID NO: 24), Abcl3 (SEQ ID NO: 26), 
Abcl4 (SEQ ID NO: 2S), Abcl5 (SEQ ID NO: 30), Abcl6 (SEQ ID NO: 32), 
Abcl7 (SEQ ID NO: 34), Abcl8 (SEQ ID NO: 36), Abcl9 (SEQ ID NO: 38), 
Abc20 (SEQ ID NO: 40), Abc21 (SEQ ID NO: 42), Abc22 (SEQ ID NO: 44), 
Abc23 (SEQ ID NO: 46), Bmr (SEQ ID NO: 48), NorA (SEQ ID NO: 50), 

15 Mfl (SEQ ID NO: 52), Mf2 (SEQ ID NO: 54), Mf3 (SEQ ID NO: 56), Mf4 
(SEQ ID NO: 58), Mf5 (SEQ ID NO: 60), Mf6 (SEQ ID NO: 62), Mf7 (SEQ 
ID NO: 64), Matel (SEQ ID NO: 66), and Smrl (SEQ ID NO: 68). The 
method includes the steps of: (a) contacting the cells with a test substance; and 
(b) determining whether the growth or metabolism of the cells is inhibited An 

20 inhibition of growth or metabolism, relative to the growth or metabolism of 
cells of the strain of bacteria having the mutation but not contacted with the 
test substance, identifies the test substance as one having the potential use as 
an antibacterial agent The bacterial strain can have disruptive mutations in 
genes encoding at least two of the proteins listed above. For example, the 

25 bacterial strain can have disruptive mutations in genes encoding three, four, 
five, or six or more of the proteins listed above. 
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In another aspect, the invention features a substantially pure nucleic 
acid molecule consisting essentially of E. faecalis Abel (SEQ ID NO: 1), Abc2 
(SEQ ID NO: 3), Abc3 (SEQ ID NO: 5), Abc4 (SEQ ID NO: 7), Abc5 (SEQ ID 
NO: 9), Abc6 (SEQ ID NO: 1 1), Abc7 (SEQ ID NO: 13), Abc8 (SEQ ID NO: 
5 15), Abc9 (SEQ ID NO: 17), AbclO (SEQ ID NO: 19), Abcll (SEQ ID NO: 
21), Abcl2 (SEQ ID NO: 23), AbclS (SEQ ID NO: 25), Abcl4 (SEQ ID NO: 
27), Abcl5 (SEQ ID NO: 29), Abcl6 (SEQ ID NO: 31), Abcl7 (SEQ ID NO: 
33), Abcl8 (SEQ ID NO: 35), Abcl9 (SEQ ID NO: 37), AbclO (SEQ ID NO: 
39), Abc21 (SEQ ID NO: 41), Abc22 {SEQ ID NO: 43), Abc23 (SEQ ID NO: 

10 45), Bmr (SEQ ID NO: 47), NorA (SEQ ID NO: 49), Mfl (SEQ ID NO: 51), 
Mf2 (SEQ ID NO: 53), Mf3 (SEQ flS NO: 55), Mf4 (SEQ ID NO: 57)» M/5 
(SEQ ID NO: 59), Mf6 (SEQ ID NO: 61), M/7 (SEQ ID NO: 63), Motel (SEQ 
ID NO: 65), or Smrl (SEQ ID NO: 67). 

In another aspect, the invention features a substantially pure 

15 polypeptide that includes a sequence selected from the group consisting of E. 
faecalis Abel (SEQ ID NO: 2), Abc2 (SEQ ID NO: 4), Abc3 (SEQ ID NO: 6), 
Abc4 (SEQ ID NO: 8), Abc5 (SEQ ID NO: 10), Abc6 (SEQ ID NO: 12), Abc7 
(SEQ ID NO: 14), Abc8 (SEQ ID NO: 16), Abc9 (SEQ ID NO: 18), AbclO 
(SEQ ID NO: 20), Abcll (SEQ ID NO: 22), Abcl2 (SEQ ID NO: 24), Abcl3 

20 (SEQ m NO: 26), Abcl4 (SEQ ID NO: 28), Abcl5 (SEQ ID NO: 30), Abcl6 
(SEQ ID NO: 32), Abcl7 (SEQ ID NO: 34), Abcl8 (SEQ ID NO: 36), Abcl9 
(SEQ ID NO: 38), Abc20 (SEQ ID NO: 40), Abc21 (SEQ ID NO: 42), Abc22 
(SEQ ID NO: 44), Abc23 (SEQ ID NO: 46), Bmr (SEQ ID NO: 48), NorA 
(SEQ ID NO: 50), Mfl (SEQ ID NO: 52), Mf2 (SEQ ID NO: 54), Mf3 (SEQ 

25 ID NO: 56), Mf4 (SEQ ED NO: 58), Mf5 (SEQ ID NO: 60), Mf6 (SEQ ID 
NO: 62), Mf7 (SEQ ID NO: 64), Matel (SEQ ID NO: 66), and Smrl (SEQ ID 
NO: 68). 
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In still another aspect, the invention features a strain of E. faecalis 
bacteria having a disruptive mutation in a gene encoding a protein selected 
from the group consisting of Abel (SEQ ID NO: 2), Abc2 (SEQ ID NO: 4), 
Abc3 (SEQ ID NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ ID NO: 10), Abc6 
5 (SEQ ID NO: 12), Abc7 (SEQ ID NO: 14), Abc8 (SEQ ID NO: 16), Abc9 
(SEQ ID NO: 18), AbclO (SEQ ID NO: 20), Abel 1 (SEQ ID NO: 22), Abcl2 
(SEQ ED NO: 24), AbcB (SEQ ID NO: 26), Abcl4 (SEQ ID NO: 28), Abcl5 
(SEQ ID NO: 30), Abcl6 (SEQ ID NO: 32), Abcl7 (SEQ ID NO: 34), Abcl8 
(SEQ ID NO: 36), Abcl9 (SEQ ID NO: 38), Al>->20 (SEQ ID NO: 40), Abc21 

10 (SEQ ID NO: 42), Abc22 (SEQ ID NO: 44), Abc23 (SEQ ID NO: 46), Bmr 
(SEQ ID NO: 48), NorA (SEQ ID NO: 50), Mfl (SEQ ID NO: 52), Mf2 (SEQ 
ID NO: 54), Mf3 (SEQ ID NO: 56), Mf4 (SEQ ID NO: 58), Mf5 (SEQ ID 
NO: 60), Mf6 (SEQ ID NO: 62), Mf7 (SEQ ID NO: 64), Matel (SEQ ID NO: 
66), and Smrl (SEQ ID NO: 68). The bacterial strain can have disruptive 

15 mutations in genes encoding at least two of the proteins listed above. More 
preferably, the bacterial strain can have disruptive mutations in genes encoding 
three, four, five, or six or more of the proteins listed above. 

The invention also features a method for determining whether a test 
substance includes a compound that blocks efflux, of an antibacterial agent 

20 from a cell. The method includes the steps of: (a) providing a polypeptide 
selected from the group consisting of E. faecalis Abel (SEQ ID NO: 2), Abc2 
(SEQ ID NO: 4), Abc3 (SEQ ID NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ 
ID NO: 10), Abc6 (SEQ ID NO: 12), Abc7 (SEQ ID NO: 14), Abc8 (SEQ ID 
NO: 16), Abc9 (SEQ ID NO: 18), AbclO (SEQ ID NO: 20), Abcll (SEQ ID 

25 NO: 22), Abcl2 (SEQ ID NO: 24), Abcl3 (SEQ ID NO: 26), Abcl4 (SEQ ID 
NO: 28), Abcl5 (SEQ ID NO: 30), Abcl6 (SEQ ID NO: 32), Abcl7 (SEQ ID 
NO: 34), Abcl8 (SEQ ED NO: 36), Abcl9 (SEQ ID NO: 38), Abc20 (SEQ ED 
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NO: 40), Abc21 (SEQ ID NO: 42), Abc22 (SEQ ID NO: 44), Abc23 (SEQ ID 
NO: 46), Bmr (SEQ ID NO: 48), NorA (SEQ ID NO: 50), Mfl (SEQ ID NO: 
52), Mf2 (SEQ ID NO: 54), M£3 (SEQ ED NO: 56), Mf4 (SEQ ID NO: 58), 
Mf5 (SEQ ID NO: 60), Mf6 (SEQ ID NO: 62), Mf7 (SEQ ID NO: 64), Matel 
5 (SEQ ID NO: 66), and Smrl (SEQ ID NO: 68); (b) contacting the polypeptide 
, with the test substance; and (c) determining whether a compound in the test 
substance binds to the polypeptide, wherein binding of the compound to the 
polypeptide identifies it as a compound that blocks efflux of an antibacterial 
>y '• agent from a cell. The method may be performed in a cell (e.g:, an E. faecalis 
10- cell, or a heterologous cell) or, alternatively, may be performed in a cell-free 
system 

In another aspect, the invention features a method for determining 
whether a test substance decreases expression of an MDR effiux pump selected 
'-• from the group consisting of E. faecalis Abel (SEQ ID NO: 2), Abc2 (SEQ ID 

15 NO: 4), Abc3 (SEQ ID NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ ID NO: 
10), Abc6 (SEQ ID NO: 12), Abc7 (SEQ ID NO: 14), Abc8 (SEQ ED NO: 16), 
Abc9 (SEQ ID NO: 18), AbclO (SEQ ID NO: 20), Abcll (SEQ ED NO: 22), 
Abcl2 (SEQ ED NO: 24), Abcl3 (SEQ ED NO: 26), Abcl4 (SEQ ID NO: 28), 
Abcl5 (SEQ ID NO: 30), Abcl6 (SEQ ED NO: 32), Abcl7 (SEQ ID NO: 34), 

20 Abcl8 (SEQ ID NO: 36), Abcl9 (SEQ ED NO: 38), Abc20 (SEQ ED NO: 40), 
Abc21 (SEQ ED NO: 42), Abc22 (SEQ ED NO: 44), Abc23 (SEQ ED NO: 46), 
Bmr (SEQ ID NO: 48), NorA (SEQ ID NO: 50), Mfl (SEQ ID NO: 52), Mf2 
(SEQ ID NO: 54), Mf3 (SEQ ED NO: 56), Mf4 (SEQ ID NO: 58), Mf5 (SEQ 
ED NO: 60), Mf6 (SEQ ED NO: 62), Mf7 (SEQ ID NO: 64), Matel (SEQ ID 

25 NO: 66), and Smrl (SEQ BD NO: 68). The method includes the steps of: (a) 
providing a cell expressing the MDR efflux pump; (b) contacting the cell with 
the test substance; and (c) measuring expression of the MDR efflux pump in 
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the cell, wherein decreased polypeptide expression, relative to a cell not 
contacted with the test substance, indicates that the test substance decreases 
expression of an MDR efflux pump. Expression of the MDR efflux pump can 
be determined by measuring protein levels of the MDR efflux pump or by 
5 measuring levels of RNA encoding the MDR efflux pump. The MDR efflux 
pump caa be expressed in any cell (e.g., a Bacillus (B.) subtilis or Lactococcus 
(L.) lactis cell). The foregoing method may also be performed in a cell-free 
system that supports transcription (e.g., whole-cell lysates). 

The invention also features a method for identifying an inhibitor of 

10 an MDR pump that includes the steps of: (a) expressing an MDR efflux pump 
derived from a first bacterial strain in a second bacterial strain to provide 
increased resistance to an antibacterial agent relative to the second bacterial 
strain not expressing the MDR efflux pump; (b) contacting the second bacterial 
strain expressing the MDR efflux pump with an amount of the antibacterial 

15 agent that inhibits growth of a control strain (i.e., the second bacterial strain 
not expressing the MDR efflux pump) but does not substantially inhibit growth 
of the second bacterial strain expressing the MDR efflux pump; (c) contacting 
the second bacterial strain expressing the MDR efflux pump with a test 
substance; and (d) measuring growth of second bacterial strain expressing the 

20 MDR efflux pump, wherein decreased growth of the bacterial strain identifies 
the test substance as one that includes an inhibitor of an MDR efflux pump. 
Step (c) can be performed before step (b), after step (b), or simultaneous with 
step (b). 

The second bacterial strain can be, for example, a strain of B. subtilis 
25 or L lactis. The MDR efflux pump can be selected from the group consisting 
of E. faecalis Abel (SEQ ID NO: 2), Abc2 (SEQ ID NO: 4), Abc3 (SEQ ID 
NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ IP NO: 10), Abc6 (SEQ ID NO: 
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12), Abc7 (SEQ ID NO: 14), Abc8 (SEQ ID NO: 16), Abc9 (SEQ ID NO: 18), 
AbclO (SEQ ID NO: 20), Abcll (SEQ ID NO: 22), Abcl2 (SEQ ID NO: 24), 
Abcl3 (SEQ ID NO: 26), Abcl4 (SEQ ID NO: 28), Abcl5 (SEQ ID NO: 30), 
Abcl6 (SEQ ID NO: 32), Abcl7 (SEQ ID NO: 34), Abel 8 (SEQ ID NO: 36), 
5 Abcl9 (SEQ ID NO: 38), Abc20 (SEQ ID NO: 40), Abc21 (SEQ ID NO: 42), 
Abc22 (SEQ ID NO: 44), Abc23 (SEQ ID NO: 46), Bmr (SEQ ID NO: 48), 
NorA (SEQ ED NO: 50), Mfl (SEQ ID NO: 52), Mf2 (SEQ ID NO: 54), Mf3 
(SEQ ID NO: 56), Mf4 (SEQ ID NO: 58), Mf5 (SEQ ID NO: 60), Mf6 (SEQ 
ID NO: 62), Mf7 (SEQ ID NO: 64), Matel (SEQ ID NO: 66), and Smrl (SEQ 

10 ID NO: 68). 

The invention also features a method for increasing the sensitivity of 
a bacterial cell to an antibacterial agent, the method including the step of 
contacting the cell wish a compound that blocks efflux of the antibacterial 
agent from the cell by binding to an MDR efflux pump selected from the group 

15 consisting of E. faecalis Abel (SEQ ID NO: 2), Abc2 (SEQ ID NO: 4), Abc3 
(SEQ ID NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ ID NO: 10), Abc6 (SEQ 
ID NO: 12), Abc7 (SEQ ID NO: 14), Abc8 (SEQ ID NO: 16), Abc9 (SEQ ID 
NO: 18), AbclO (SEQ ID NO: 20), Abcll (SEQ ID NO: 22), Abcl2 (SEQ ID 
NO: 24), Abcl3 (SEQ ID NO: 26), Abcl4 (SEQ ID NO: 28), Abcl5 (SEQ ID 

20 NO: 30), Abcl6 (SEQ ID NO: 32), Abcl7 (SEQ ID NO: 34), Abcl8 (SEQ ID 
NO: 36), Abcl9 (SEQ ID NO: 38), Abc20 (SEQ ID NO: 40), Abc21 (SEQ ID 
NO: 42), Abc22 (SEQ ID NO: 44), Abc23 (SEQ ID NO: 46), Bmr (SEQ ID 
NO: 48), NorA (SEQ ID NO: 50), Mfl (SEQ ID NO: 52), Mf2 (SEQ ID NO: 
54), Mf3 (SEQ ID NO: 56), Mf4 (SEQ ID NO: 58), Mf5 (SEQ ID NO: 60), 

25 Mf6 (SEQ ID NO: 62), Mf7 (SEQ ID NO: 64), Matel (SEQ ID NO: 66), and 
Smrl (SEQ ID NO: 68). The cell can be an E. faecalis bacterial cell, but can 
also be, for example, any bacterial cell described herein. 
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In a related aspect, the invention features a method for increasing the 
sensitivity of a bacterial cell to an antibacterial agent, the method including the 
step of contacting the cell with a compound that blocks efflux of the 
antibacterial agent from the cell by decreasing the expression of a gene 
5 encoding an MDR efflux pump selected from the group consisting of E. 

faeccdis Abel (SEQ ID NO: 2), Abc2 (SEQ ID NO: 4), Abc3 (SEQ ID NO: 6), 
Abc4 (SEQ ID NO: 8), Abc5 (SEQ ID NO: 10), Abc6 (SEQ ID NO: 12), Abc7 
(SEQ ID NO: 14), Abc8 (SEQ ID NO: 16), Abc9 (SEQ ID NO: 18), AbclO 
(SEQ ID NO: 20), Abcll (SEQ ID;NO: 22), Abcl2 (SEQ ID NO: 24), Abcl3 

10 (SEQ ID NO: 26), Abcl4 (SEQ ID NO: 28), Abcl5 (SEQ ID NO: 30), Abcl6 
(SEQ ID NO: 32), Abcl7 (SEQ ID NO: 34), Abcl8 (SEQ ID NO: 36), Abcl9 
(SEQ ID NO: 38), Abc20 (SEQ ID NO: 40), Abc21 (SEQ ID NO: 42), Abc22 
(SEQ ID NO: 44), Abc23 (SEQ ID NO: 46), Bmr (SEQ ID NO: 48), NorA 
(SEQ ID NO: 50), Mfl (SEQ ID NO: 52), Mf2 (SEQ ID NO: 54), MB (SEQ 

15 ID NO: 56), Mf4 (SEQ ID NO: 58), Mf5 (SEQ ID NO: 60), Mf6 (SEQ ID 
NO: 62), Mf7 (SEQ ID NO: 64), Matel (SEQ ID NO: 66), and Smrl (SEQ ID 
NO: 68). The cell can be an E. faeccdis bacterial cell, but can also be, for 
example, any bacterial cell described herein. 

In still another aspect, the invention features a method for 

20 determining whether a test substance includes a compound that reduces efflux 
of antibacterial agents from a cell. The method includes the steps of: (a) 
providing a non-bacterial cell expressing a nucleic acid molecule encoding an 
MDR efflux pump selected from E. faecalis Abel (SEQ ID NO: 2), Abc2 
(SEQ ID NO: 4), Abc3 (SEQ ID NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ 

25 ID NO: 10), Abc6 (SEQ ID NO: 12), Abc7 (SEQ ID NO: 14), Abc8 (SEQ ID 
NO: 16), Abc9 (SEQ ID NO: 18), AbclO (SEQ ID NO: 20), Abel 1 (SEQ ID 
NO: 22), Abcl2 (SEQ ID NO: 24), Abcl3 (SEQ ID NO: 26), Abcl4 (SEQ ID 
NO: 28), Abcl5 (SEQ ID NO: 30), Abcl6 (SEQ ID NO: 32), Abcl7 (SEQ ID 
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NO: 34), Abcl8 (SEQ ID NO: 36), Abcl9 (SEQ ID NO: 38), Abc20 (SEQ ID 
NO: 40), Abc21 (SEQ ID NO: 42), Abc22 (SEQ ID NO: 44), Abc23 (SEQ ID 
NO: 46), Bmr (SEQ ID NO: 48), NorA (SEQ ID NO: 50), Mfl (SEQ ID NO: 
52), M£2 (SEQ ID NO: 54), Mf3 (SEQ ID NO: 56), Mf4 (SEQ ID NO: 58), 
5 Mf5 (SEQ ID NO: 60), Mf6 (SEQ ID NO: 62), Mf7 (SEQ ID NO: 64), Matel 
(SEQ ID NO: 66), and Smrl (SEQ ID NO: 68); (b) contacting the cell with the 
test substance; (c) contacting the cell with a compound that is capable of (i) 
entering the cell and (ii) being transported by the MDR efflux pump; and (d) 
measuring efflux of the compound from the cell, wherein decreased efflux, 
10 relative to a cell not contacted with the test substance, identifies the test 

substance as containing a compound that reduces efflux of antibacterial agents 
from a cell. The cell can be, for example, a eukaryotic cell, such as a 
mammalian cell, an insect cell, or a yeast cell (a Saccharomyces cerevisiae 
cell). 

15 The invention also features a method for determining whether a test 

substance includes a compound that reduces efflux of antibacterial agents from 
a cell. This method includes the steps of: (a) providing a cell free system 
consisting of: (i) a lipid membrane into which is inserted an MDR efflux pump 
selected from Enterococcusfaeccdis Abel (SEQ ID NO: 2), Abc2 (SEQ ID 

20 NO: 4), Abc3 (SEQ ED NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ ID NO: 
10), Abc6 (SEQ ID NO: 12), Abc7 (SEQ ED NO: 14), Abc8 (SEQ ED NO: 16), 
Abc9 (SEQ ED NO: 18), AbclO (SEQ ED NO: 20), Abel 1 (SEQ ED NO: 22), 
Abcl2 (SEQ ED NO: 24), Abcl3 (SEQ ED NO: 26), Abcl4 (SEQ ID NO: 28), 
Abcl5 (SEQ ED NO: 30), Abcl6 (SEQ ED NO: 32), Abcl7 (SEQ ID NO: 34), 

25 Abcl8 (SEQ ED NO: 36), Abcl9 (SEQ ED NO: 38), Abc20 (SEQ ID NO: 40), 
Abc21 (SEQ ED NO: 42), Abc22 (SEQ ED NO: 44), Abc23 (SEQ ID NO: 46), 
Bmr (SEQ ED NO: 48), NorA (SEQ ED NO: 50), Mfl (SEQ ID NO: 52), Mf2 
(SEQ ED NO: 54), Mf3 (SEQ ID NO: 56), Mf4 (SEQ ED NO: 58), Mf5 (SEQ 
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ID NO: 60), Mf6 (SEQ ID NO: 62), MT7 (SEQ ID NO: 64), Matel (SEQ ID 
NO: 66), and Smrl (SEQ ID NO: 68); (ii) the test substance; and (iii) a 
compound that is capable of being transported by the MDR efflux pump; and 
(b) measuring transport of the compound across the lipid membrane, wherein 
5 decreased transport, relative to transport in a cell-free system not comprising 
the test substance, identifies the test substance as containing a compound that 
reduces efflux of antibacterial agents from a cell. In one example, the lipid, 
membrane is in the form of a liposome. 

By "MDR efflux pump" is meant any cell transporter that has been 

10 functionally shown to actively transport a drag or chemic&l out of ia cell. 

By "substantially identical" is meant that 30 or more consecutive 
nucleotides exhibit at least 50%, preferably 85%, more preferably 90%, and 
most preferably 95% identity to a reference nucleotide sequence. For 
homologous recombination, the length of substantial identity will generally be 

15 at least 30 nucleotides, preferably at least 50 nucleotides, more preferably at 
least 100 nucleotides, and most preferably 200 nucleotides. 

By "high identity" is meant that two sequences share at least 10% of 
nucleotides or 20% of amino acids, when optimally aligned, such as by the 
program BLAST, over a comparison window of 90 nucleotides or 30 amino 

20 acids. Preferably, the percent identity is at least 25%, 30%, 40%, or even 50% 
and more preferably, the identity is at least 70% or even 80%. One sequence 
may include additions or deletions (i.e., gaps) of 20% or less when compared 
to the second sequence. Optimal alignment of sequences may be conducted, 
for example, by the methods of Gish and States (Nature Genet 3:266-272, 

25 1993), Altshul et al. (J. MoL Biol. 215:403-410, 1990), Madden et al. (Meth. 
Enzymol. 266:131-141, 1996), Althsul et al (Nucleic Acids Res. 25:3389- 
3402, 1997), or Zhang et al (Genome Res. 7:649-656, 1997). 
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By "disruptive mutation" is meant an alteration in the nucleic acid 
molecule that results in decreased expression or function of the encoded 
protein. Preferably, the decrease is at least 50%, more preferably it is at least 
80%, and most preferably the decrease is >99%. The mutation can be a point 
5 mutation but preferably is an insertional mutation or a deletion mutation. 
Mutations in the coding region and in the region regulating expression (e g., 
the promoter) are each likely to be disruptive. 

By "test substance" is meant a compound (or collection of 
compounds) being tested for its ability to inhibit growth or metabolism of a 

10 bacterial strain. Specifically excluded are compounds that were previously 
known in the public domain to inhibit the growth or metabolism of that species 
of bacteria. For example, erythromycin would not be considered to be a test 
substance when administered to a culture of E. faecalis because it was known 
that erythromycin inhibited growth or metabolism of that bacterium. 

15 Any test substance can be used in the present screening methods, 

including naturally occurring substances and non-naturally occurring 
substances. Exemplary test substances are low molecular weight substances 
produced by living organisms or other substances, soluble in the growth 
medium, having a molecular weight between 150 and 750 daltons. Test 

20 substances can be individual compounds or libraries of compounds. One 
skilled in the art will recognize that there are numerous sources for the test 
substance, and any of these is suitable for use in the present screening method. 

Determining whether the growth or metabolism of a cell is inhibited 
can be performed using any of a number of standard assays, such as those 

25 described herein. Cellular metabolism can be measured, for example, by 
assessing the activity of one or more enzymes that function in cellular 
metabolic pathways. Alternatively, cellular metabolism can be measured by 
any of the surrogate markers that are known to cell biologists. Methods for 
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determining growth or metabolism include physical observation (e.g., MIC 
determination) and measuring turbidity (e.g., absorbance at 650 nm; IC^ 
determination). 

The method for determining growth or metabolism can be adapted 
5 for high throughput screening, for example, by culturing the cells in multi-well 
or microliter platen and measuring growth or metabolism 
spectrophotometrically. 

In the absence of any test substance, a bacterial strain having a 
mutation may exhibit a different rate of growth or metabolism than that 

10 exhibited by the parental strain not having the mutation. Accordingly, it is 
desirable to determine whether a test substance inhibits the growth or 
metabolism of a cell by comparing the growth or metabolism of a bacterial 
strain in the presence of the test substance with the growth or metabolism of 
that strain in the absence of the test substance. 

15 A test substance is considered to inhibit the growth or metabolism of 

a bacterium if it decreases growth or metabolism by at least 10% when 
administered at a final concentration of up to 100 jig/ml to a culture of that 
bacterium, as determined using one of the foregoing assays. Preferably, 
growth or metabolism is decreased by at least 25%, more preferably by at least 

20 50%, and most preferably by at least 75%. 

The invention provides methods and reagents for (i) identifying 
MDR efflux pumps (ii) knocking out the function of one or more genes 
encoding MDR efflux pumps in E. faeccdis, (iii) using these mutated bacterial 
strains to identify novel antibacterial agents, and (iv) using the novel genes 

25 encoding MDR efflux pumps to identify compounds that increase the 

sensitivity of a bacterial cell to an antibacterial agent. The gene disruption 
techniques are broadly applicable for knocking out genes in a wide variety of 
bacteria, including, for example, Staphylococcus aureus, Streptococcus 
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pyogenes, Bacillus anthracis, Clostridium tetani, Clostridium bolulinum, 
Vibrio cholerae, Helicobacter pylori, Salmonella typhimurium, Shigella 
dysenteriae, Bordetella pertussis, Yersinia pestis, Pseudomonas aeruginosa, 
Neisseria gonorrhoeae, Mycobacterium tuberculosis, Corynebacterium 
5 diptheriae, Borrelia burdorferi, Treponema pallidum, Enterococcus faeccdis, 
Enterococcus faecium, and Streptococcus pneumoniae. 

Other features and advantages of the invention will be apparent from 
the following description of the preferred embodiments thereof, and from the 
claims. 

10 Brief Description of the Drawings K 

Fig. 1 is a schematic illustration showing single gene knockout 
generation by gene disruption. Bacterial cells in which the target gene has 
been disrupted are isolated by selection in the presence of kanamycin. This 
method cannot be used for the generation of multigene knockout strains 
15 because once the kanamycin-bearing plasmid is inserted, a second kanamycin- 
bearing plasmid can no longer be used for selection of another gene disruption. 
Additionally, the plasmid backbone integrated into the genome provides an 
area for non-targeted homologous recombination. 

Figs. 2A-2F are a series of schematic illustrations showing E. 
20 faecalis MDR-knockout strains are more sensitive to antibacterial agents. 

Fig. 3 is a schematic illustration showing gene knockout by 
disruption, followed by marker and gene target deletion. Sequences A and B, 
which flank the target gene, are amplified by PCR and cloned into a vector that 
provides kanamycin resistance. The bacteria are transformed with this vector 
25 and selected for kanamycin-resistant bacteria (thus selecting for bacteria in 
which the vector has integrated into the chromosomal DNA due to a first 
crossover). Subsequently, the cells are grown in the absence of kanamycin. 
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During this phase, if a second crossover occurs, the target gene and the 
kanamycin resistance gene are each lost Thus, the desired bacteria will die in 
the presence of kanamycin. These kanamycin-sensitive cells can be identified 
by replicate plating, as described herein. 
5 Fig. 4 is a schematic illustration showing E. faecalis Abel nucleotide 

sequence (SEQ ID NO: 1) and Abel (SEQ ID NO: 2) amino acid sequence. 

Fig. 5 is a schematic illustration showing K faecalis Abc2 nucleotide 
sequence (SEQ ID NO: 3) and Abc2 amino acid sequence (SEQ ID NO: 4). 

Fig. 6 is a schematic illustration showing 2?. faecalis Abc3 nucleotide 
10 sequence (SEQ ID NO: 5) and Abc3 amino acid sequence (SEQ ID NO: 6), 

Fig. 7 is a schematic illustration showing E. faecalis Abc4 nucleotide 
sequence (SEQ ID NO: 7) and Abc4 amino acid sequence (SEQ ID NO: 8). 

Fig. 8 is a schematic illustration showing E. faecalis Abc5 nucleotide 
sequence (SEQ ID NO: 9) and Abc5 amino acid sequence (SEQ ID NO: 10). 
15 Fig. 9 is a schematic illustration showing R faecalis Abc6 nucleotide 

sequence (SEQ ID NO: 1 1) and Abc6 amino acid sequence (SEQ ID NO: 12). 

Fig. 10 is a schematic illustration showing E. faecalis Abc7 
nucleotide sequence (SEQ ID NO: 13) and Abc7 amino acid sequence (SEQ 
ID NO: 14), 

20 Fig. 1 1 is a schematic illustration showing E. faecalis Abc8 

nucleotide sequence (SEQ ID NO: 15) and Abc8 amino acid sequence (SEQ 
ID NO: 16), 

Fig. 12 is a schematic illustration showing E. faecalis Abc9 
nucleotide sequence (SEQ ID NO: 17) and Abc9 amino acid sequence (SEQ 
25 ID NO: 18), 

Fig. 13 is a schematic illustration showing E. faecalis AbclO 
nucleotide sequence (SEQ ID NO: 19) and AbclO amino acid sequence (SEQ 
ID NO: 20), 
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Fig. 14 is a schematic illustration showing E. faecalis Abcll 
nucleotide sequence (SEQ ID NO: 21) and Abcll amino acid sequence (SEQ 
ID NO: 22), 

Fig. 15 is a schematic illustration showing E. faecalis Abcl2 
5 nucleotide sequence (SEQ ID NO: 23) and Abcl2 amino acid sequence (SEQ 
ID NO: 24), 

Fig. 16 is a schematic illustration showing E. faecalis Abe 13 
nucleotide sequence (SEQ ID NO: 25) and Abcl3 amino acid sequence (SEQ 
ID NO: 26), 

10 Fig. 17 is a schematic illustration showing E faecalis Abcl4 

nucleotide sequence (SEQ ID NO: 27) and Abcl4 amine acid sequence (SEQ 
ID NO: 28), 

Fig. 18 is a schematic illustration showing E. faecalis Abe 15 
nucleotide sequence (SEQ ID NO: 29) and Abcl5 amino acid sequence (SEQ 
15 ID NO: 30), 

Fig. 19 is a schematic illustration showing E. faecalis Abcl6 
nucleotide sequence (SEQ ID NO: 31) and Abcl6 amino acid sequence (SEQ 
ID NO: 32), 

Fig. 20 is a schematic illustration showing E. faecalis Abel 7 
20 nucleotide sequence (SEQ ID NO: 33) and Abcl7 amino acid sequence (SEQ 
ID NO: 34), 

Fig. 21 is a schematic illustration showing E. faecalis Abcl8 
nucleotide sequence (SEQ ID NO: 35) and Abcl8 amino acid sequence (SEQ 
ID NO: 36), 

25 Fig. 22 is a schematic illustration showing E. faecalis Abcl9 

nucleotide sequence (SEQ ID NO: 37) and Abe 19 amino acid sequence (SEQ 
ID NO: 38), 



-18- 



WO 01/79257 



PCT/US01/12230 



Fig. 23 is a schematic illustration showing E. faecalis Abc20 
nucleotide sequence (SEQ ID NO: 39) and Abc20 amino acid sequence (SEQ 
ID NO: 40), 

Fig. 24 is a schematic illustration showing E. faecalis Abc21 
5 nucleotide sequence (SEQ ID NO: 41) and Abc21 amino acid sequence (SEQ 
ID NO: 42), 

Fig. 25 is a schematic illustration showing E. faecalis Abc22 
nucleotide sequence (SEQ ID NO: 43) and Abc22 amino acid sequence (SEQ 
ID NO: 44), . 

10 Fig. 26 is a schematic illustration showing E. faecalis Abc23 

nucleotide sequence*(SEQ ID NO: 45) and Abc23 amino acid sequence (SEQ 
ID NO: 46), 

Fig. 27 is a schematic illustration showing E. faecalis Bmr 
nucleotide sequence (SEQ ID NO: 47) and Bmr amino acid sequence (SEQ ID 
15 NO: 48), 

Fig. 28 is a schematic illustration showing E. faecalis NorA 
nucleotide sequence (SEQ ID NO: 49) and NorA amino acid sequence (SEQ 
ID NO: 50), 

Fig. 29 is a schematic illustration showing E. faecalis Mfl 
20 nucleotide sequence (SEQ ID NO: 51) and Mfl amino acid sequence (SEQ ID 
NO: 52), 

Fig. 30 is a schematic illustration showing E. faecalis Mfl 
nucleotide sequence (SEQ ID NO: 53) and Mf2 amino acid sequence (SEQ ID 
NO: 54), 

25 Fig. 3 1 is a schematic illustration showing E. faecalis MP 

nucleotide sequence (SEQ ID NO: 55) and Mf3 amino acid sequence (SEQ ID 
NO: 56), 
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Fig. 32 is a schematic illustration showing E.faecalis Mf4 
nucleotide sequence (SEQ ID NO: 57) and Mf4 amino acid sequence (SEQ ID 
NO: 58), 

Fig. 33 is a schematic illustration showing E.faecalis MfS 
5 nucleotide sequence (SEQ ID NO: 59) and Mf5 amino acid sequence (SEQ ID 
NO: 60), 

Fig. 34 is a schematic illustration showing E. faecalis MfS 
nucleotide sequence (SEQ ID NO: 61) and Mf6 amino acid sequence (SEQ ID 
NO: 62), 

10 Fig. 35 is a schematic illustration showing E. faecalis Mf7 

nucleotide sequence (SEQ ID NO: 63) and Mf7 smino acid sequence (SEQ ID 
NO: 64), 

Fig. 36 is a schematic illustration showing E. faecalis Matel 
nucleotide sequence (SEQ ID NO: 65) and Mate! amino acid sequence (SEQ 
15 ID NO: 66), 

Fig. 37 is a schematic illustration showing E. faecalis Smrl 
nucleotide sequence (SEQ ID NO: 67) and Smrl amino acid sequence (SEQ 
ID NO: 68). 

Detailed Description 

20 We have invented a method for identifying MDR efflux pumps by 

first mining bacterial genome databases to identify candidate pumps, and then 
validating function using a disruption-mediated gene knockout strategy. Using 
this method, we have discovered over thirty candidate MDR efflux pumps in 
E. faecalis. Gene knockout experiments to date have verified that at least five 

25 of these efflux pumps do function as MDR efflux pumps. 
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We have also discovered methods for generating bacterial strains in 
which two or more MDR efflux pumps have been mutated by gene deletion. 
These bacterial strains are useful for screening test substances for their ability 
to inhibit the growth or metabolism of bacterial cells. 

5 Example > : Genomic Sequence Database Mining to Identify Candidate ; 
Drug-Efflux Pumps in E. faecalis 

We used the DNA sequence and membrane topology of several 
known MDR efflux pumps to probe, using a series of BLAST searches, for' 
homologies within the genomic sequence of E. faecalis (Worley et aL, 

10 Genome Res. 5:173-184, 1995; see Table 1; the database can be accessed £c 
http://www.tigr r org). The amino acid sequence from a known MDR efflux 
pump from other bacterial and fungal species were used as probes in BLAST 
searches cf the TIGR database (WU-BLAST version 2.0). A list of contigs 
and their P-values was obtained. P-values under the cut-off of le-20 were 

15 considered for further analysis. The ORFs from each contig were located and 
examined for the presence of predicted transmembrane coding regions using 
Sosui version 1.0/10 (Hirokawa et aL, Bioinformatics 14:378-379, 1998). For 
the ABC family, ABC signature sequences and ATP/GTP binding site motif A 
(P-loop) using Motif finder (http://www.motif.genome.ad.jp/). ORFs that 

20 satisfied the foregoing criteria were then used as probes in BLAST searches 
against protein databases in order to identify ORFs having orthologues that are 
known MDR efflux pumps (it is not necessary that an ORF have an orthologue 
that is a known MDR efflux pump; this step aids in prioritizing candidate 
MDR efflux pumps for testing). 
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Table 1 



Organism 


MDR Pump Gene 


Transporter Family 


Accession Number 


S. aureus 


NorA 


MF 


P21191 


B. subtilis 


Bmr 


MF 


P33449 


B. subtilis 


Bit 


MF 


L32599 


S. pneumoniae . 


PmrA 


MF 


AJ07367 


L. lactis 


LmrP 


MF 


X89779 


L. lactis 


LmrA 


ABC 


U63741 


S. epidermidis 


MsrA 


ABC 


X52085 



10 The ABC Family 

The members of the ABC family of proteins share an ATP binding 
cassette sequence motif that enables the identification of E. faecalis sequences 
having high identity on the amino acid level to L lactis LmrA and S. 
epidermidis MsrA (designated Abel through Abc23, respectively; Table 2). 

15 Only a minority of ABC proteins are predicted to act as drug transporters, and, 
at present, there is no easy way to distinguish MDR- ABC proteins from those 
that perform any of the myriad of non-MDR related functions within the cell. 
Thus, a second step of the method includes functional testing of the candidate 
encoded polypeptides by generating disruption or deletion mutants (see 

20 Examples 2 and 4). 

The Multiple Facilitator Superfamily 

We scanned the E. faecalis genomic database using as probes a set 

of known MDR genes of the MF superfamily from a set of Gram-positive 

bacteria (e.g. NorA, Bmr y Bit, PmrA; Table 1). We identified a set of 

25 sequences having high identity to one or more of the probes; these sequences 

were designated Mfl through Mf7 (Table 2). The K faecalis genes designated 

NorA and Bmr were labeled such because of the high sequence identity to the 
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equivalent genes in S. aureus and B. subtilis, respectively (Table 1). Other MF 
genes in E. faecalis are also highly similar to these genes. As with the ABC 
homologues, the MF family of proteins most likely have a broad range of 
functions, many of which are unrelated to MDR. Therefore, functional testing 
5 by drug sensitivity is performed to definitively designate these proteins as 
MDR efflux pumps (see below). ? 

MATE and Smr Families 

A family of transporters termed MATE (multidrug and tasic 
compound extrusion transporters) has been described based on sequence 

10 homology to the NorM multidrug efflux protein of V. parahaemolytirus and 
the YdhE gene product of E. coli and Haemophilus (H.) influenzae (Morita et 
al., Antimicrob. Agents Chemother. 42:1778-1782, 1998; Brown et Mol. 
Microbiol. 31:394-395, 1999). The Smr family consists of proteins with four 
transmembrane-spanning regions, and the MDR efflux proteins within this 

15 family are the smallest known secondary transporters (Paulsen et aL, MoL 
Microbiol. 19: 1167-1 175, 1996). Using the E. faecalis genomic database, we 
scanned for potential members of the MATE transporter protein family with 5. 
subtilis Ypn, YisQ, and YucE, and H. influenza YdhE, and for potential 
members of the Smr transporter protein family with B. subtilis EbrA and EbrB. 

20 By conducting orthologue searching of MDR efflux pump gene families, 
we identified candidates for gene-knockout generation and drug sensitivity 
analysis in K faecalis (Table 2). 
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Table 2 



E. faecalis 


Homologue 


Organism 


Accession # 


E value* 












Abel (SEQ ID NO: 2) 


Tap-tub2 


M. tuberculosis 


Q11047 


9.20E-114 


(Contig 6453) 


YfiC 


B. subtilis 


P54719 


4 80E-110 




mdl 


E. coli 


P30751 


2.50E^107 




Tap-tub 


M. tuberculosis 


Q11046 


3.50E-97 




AtrJLi 


A. nidulans 


A T7/Y7 1/111 
AH// 1411 


1.80E-90 






A Jwnigatus 


T IKOOI/I 

UOZlo4 


3oUJti-yu 


Abc2(SEQIDNO:4) 


Tap-tub2 


M. tuberculosis 


Q11047 


4.70E-123 




mux 


XL. COU 


r jU/jI 


j.jUxWI / 




YfiP 


Dm oUl/iULi 


PC4710 
X J*f / JLJr 






AtrD 


A. nidulans 


AF071411 

* JUL vy f ITi A 


6 90E-106 




YfEB 


B. subtilis 


P54718 


9.30E-103 


Abc3(SEQIDNO:6) 


Yhel 


B. subtilis 


CAB12811 


8.40E-106 


(Contig 6222) 


ComA 


S. pneumoniae 


M36180 


630E-57 




afuMDRl 


A. Jwnigatus 


U62934 


1.50E-56 


Abc4 (SEQ ID NO: 8) 


Tap-tub2 


M. tuberculosis 


Q11047 


L70E-143 


(Contig 6207) 


YfiC 


B. subtilis 


P54719 


2.80E-117 


Abc5 (SEQ ID NO: 10) 


Tap-tab 


M. tuberculosis 


Q11046 


1.80E-115 


(Contig 62U7) 


mdl 


E. coli 


P30751 


1.40E-104 




AtrD 


A. nidulans 


AF071411 : 


4.50E-103 




YfiTl 

X LLD 


n. suo mis 


WiAT\ ft 


1 *7AR 1A1 




afuMDRl 


A. fumigatus 


U62934 


4.90E-96 




LmrA 


L lactis 


U63741 


5.20E-73 




ComA 


S. pneumoniae 


M36180 


8.40E-58 




pfmdrl 


P. falciparum 


M29154 


2.80E-56 


Abco (SEQ ID NO: 12) 


LmrA 


L lactis 


U63741 ' 


6.00E-75 


(Contiff 6344 1 


KAJLUl\ 


l>. prieuinuniue 


IVUDloU 


CUUE-OJ 




afuMDRl 


A. Jwnigatus 


U62934 


5.60E-56 


A 1~ _'"7 /PT7A TT"V XTA. 1 j|\ 

Abe/ (SEQ ID NO: 14) 


A i_T\ 

AtrD 


A * J I 

A. nidulans 


AF0714H 


5.30E-106 


(Contig 6226) 


YfiC 


B. subtilis 


P54719 


6.90E-105 




aniMDRl 


A fumigatus 


U62934 


4.70E-103 




YfiB 


1?. subtilis 


P54718 


6.80E-100 




LmrA 


La* (UdlU 


TT6Y741 


1 70R-71 

1 . / \JA-S~ 1 A. 




ComA 


5. pneumoniae 


M36180 


1.40E-57 




mdl 
lulu 


Juj. COll 






(Contig 6358) 


AtrD 


A. nidulans 


AF071411 


8.20E-115 




YfiC 


B. subtilis 


P54719 


5.80E-108 




afuMDRl 


A. fumigatus 


U62934 


1.20E-1Q2 




Yhel 


B. subtilis 


CAB 12811 


4.90E-90 




LmrA 


L lactis ! 


U63741 


. 3.10E-71 


Abc9(SEQIDNO: 18) 


SunT 


B. subtilis 


CAB 14065 


7.70E-58 


(Contig 6217) 
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E. faecalis 
sequence identifier* 


Homologue 


Organism 


Accession # 


E value" 




AbclO (SEQ ID NO: 20) 
(Contig6328) 


cydC 


H. influenza 


AAC22811 


2.40E-51 




Abcll(SEQIDNO:22) 
(Contig 6185) 


Tap-tub2 
mdl 
AtrD 

Tap-tub 
afuMDRl 


M. tuberculosis 
K coli 

A. nidulans 
M. tuberculosis 

A fumigatus 


Q11047 
P30751 
AF071411 

Q1.1046 
U62934 


5.60E-109 
1.30E-97 
6.10E-97 
2.30E-87 
1.60E-85 


5 

• 


Abcl2(SEQIDNO:24) 
(Contig 6317) 


afuMDRl 
AtrD 
pfimdrl 


A. fumigatus 
A. nidulans 
P. falciparum 


U62934 
AF071411 
M29154 


3.30E-81 
5.90E-81 
4.30E-66 




Abcl3(SEQIDNO:26) 
(Contig 6285) 


SunT 


B. subtUis 


CAB 14065 


6.30E-21 


10 


Abcl4 (SEQ ID NO: 28) 
(Contig 6864) 


mdl 


E. coli 


P3CI751 


7.20E-92 




Abcl5 (SEQ ID NO: 30) 
(Contig 6183) 


StpC 
DrrA 


S. aureus 
S. peucetius 


Z30588 
K4201O 


4.60E-27 
1.00E-25 




Abcl6 (SEQ ID NO: 32) 
(Contig 6229) 


Msr A 
SmrB 


S. epidennuus 
S. ambofaciens 


JL;2085 
X63451 


3.10E-88 
8.20E-31 


Id 


Abclv (SEQ ID NO: 34) 
(Contig 6174) 


SmrB 
TkC 


S. ambofaciens 
S.fradiae 


X63451 
M57437 


7.10E-62 
8.00E-57 




Abcl8 (SEQ ID NO: 36) 
(Contig 6381) 


SmrB 
ThC 


S. ambofaciens 
S.fradiae 


X63451 
M57437 


5.90E-62 
2.40E-51 


20 


Abcl9(SEQIDNO: 38) 
(Contig 6402) 


TIrC 
SmrB 


, S.fradiae 
S. ambofaciens 


M57437 
X63451 


1.00E-46 
1.80E-43 




Abc20 (SEQ ID NO: 40) 
(Contig 6203) 


DrrA 


S. peucetius 


P32010 


2.50E-35 




Abc21 (SEQ ID NO: 42) 
(Contig 6237) 


DrrA 


S. peucetius 


P32010 


3.40E-24 


25 


Abc22 (SEQ ID NO: 44) 
(Contig 6232) 


SmrB 
TlrC 


S. ambofaciens 
S.fradiae 


X63451 
M57437 


5.70E-43 
5.00E-4O 




Abc23(SEQIDNO:46) 
(Contig 6274) 


MsrA 


S. epidermidis 


X52085 


3.60E-39 


30 


Bmr (SEQ ID NO: 48) 
(Contig 6187) 


PmrA 


S. pneumoniae 


AJ007367 


7.30E-104 




NorA (SEQ ID NO: 50) 
(Contig 6353) 


NorA 
Bmr 
Bit 


S. aureus 
B. subtilis 
B. subtilis 


SWP21191 
SWP33449 
EML32599 


1.00E-73 
1.60E-89 
4.60E-92 
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E. jaecalis 


Homologue 


Organism 


Accession # 


£ value 


sequence luenuiier^ 










Mfl (SEQ ED NO: 52) 


Ptr 


S. pristinaespiralis 


X84072 


4.00E-59 


ffYmtia fft36i 


T frA 


Irl. offliZgftllUlo 


TT4/V1R7 

U*H-rrO / 








£>. bUUllllo 










SI nitron c 


UJ3 AJUU^O 








.C tvnhi murium 








I Villi 11 


E. coli 


BAA16553 


1 40E-39 




Pmr^ 

XJllHJ 


J? vijhtilix 

Urn l3 (U/iUtO 


ZOO 105 


9 90R-^4 


Mf2 (SEQ ID NO: 54) 


Bmr3 


B. subtilis 


Z99105 


7.80E-62 


(Contig 6226) 


YfiU 


B. subtilis 


BAA24461 


1.80E46 




PmrR 

CUttD 


F rnli 






r 


Mmr 


i>. SUDIUIS 








nr 


k>. pnsntuiespiruiis 


YSMA79 


1 1AP 




SmvA 


S. typhimuriwn 


P37594 


1.40E-34 




LfrA 


M. smegmatis 


U40487 


1.40E^32 


■*;/f 


QacA 


S. aureus 


X56628 


4.60&30 


JYIIO ^oliv joj 


pmrA 


o. pneumonuie 


A 1007^7 
AJUU /jO / 








o. aureus 


P91 101 






nit 


I>. SUDTUIS 


LoZDW 


9 tftfH? 99 
Z.DUXi-ZZ 




15 mr 


B. subtilis 




D.lVrrZZ 


Mf4 (SEQ ID NO: 58) 


YcnB 


B. subtilis 


BAA09056 


2.30E-129 




i_/inrjt> 


x>. SUDTUIS 








I?mrR 

jDinrD 


F rnli 
Ej. cut l 








Ptr ! 


l>. pristinueopiriiiia 




1 QAP^O 




Mmr 


£>. bUUllitb 




U.OV/iJr J / 




Jj I 111 J 




7001 OS 






QacA 


S. aureus 


X56628 


4.80E-30 




T7rrrrR 
EHIl P 


F rrtli 


i>r\r\ 1 Ojjj 






Mmr 


£>. SUDIUIS 




9 7AP-A^ 






0. suvmis 


*7Q0m^ 


7 OOP 






o. aureus 




^ inp ^7 




LfrA 


M. smegmatis 


U40487 


3.90E-31 




Bcr 


E. coli 


PRJN0659 


6.20E-15 


Mf6 (SEQ ID NO: 62) 


LmrB 


B. subtilis 


BAA22228 


1.90E-67 


(Contig 6248) 


YcnB 


B. subtilis 


BAA09056 


4.90E-54 




Mmr 


B. subtilis 


Q00538 


8.30E-29 




QacA 


S. aureus 


X56628 


1.80E-27 




Bmr3 


B. subtilis 


Z99105 


3.60E-26 


Mf7 (SEQ ID NO: 64) 


YdgK 


B. subtilis 


BAA09056 


4.70E-28 


(Contig 6473) 


EmrD 


E. coli 


P31442 


4.20E-27 
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K faecalis 


Homologue 


Organism 


Accession # 


£ value" 


sequence identifier* 










Matel(SEQIDNO:66) 


YpnP 


B. subtitis 


P54181 


1.10E-13 


(Contig6563) 


YucE 


B. subtilis 


CAB12921 


3.90E-10 l 




YdhE 


H. influenza 


P45272 


9.90E-08 




YisQ 


B. subtilis 


CAA70645 


6.00E-07 


Smrl(SEQIDNO:68) 


EbrA 


B. subtilis 


CAB13614 


2.60E-24 


(Contig6338) 


EbrB 


B. subtilis 


CAB13613 


3.70E-20 



5 "Hie E- value relates to the Smallest Sum probability of the number of hits one can expect to see by 

chance when searching a database of a certain size. The analysis was performed according to Gish and 
States (Nature Genet 3:266-272, 1993) and Worley et al. (Genome. Res. 5: 173-184, 1995). 
♦denotes contig number in HGR database (http://www.tigr.org) 

Example 2: Development of Tools to Generate Gene-Knockout Mutants 

10 The function of newly discovered genes can be studied by 

gene-specific mutagenesis, commonly referred to as a gene-knockout, followed 
by phenotypic characterization. Gene-knockout mutants in microorganisms are 
usually generated by the following steps: (a) construction of a vector that 
contains a portion of the target gene to be inactivated, (b) introduction into the 

15 host organism of the knockout vector, (c) recombination of the knockout 

vector into the target gene via homologous recombination such that integration 
disables gene function, and (d) selection of the knockout mutant using a 
dominant drug selection marker (e.g., kanamycin). The aforementioned 
strategy has been used widely in bacteria and fungi and is well developed for 

20 use in the model Gram-negative bacterium, E. coli, and Gram-positive 

bacterium, B. subtilis, respectively (Link et al., J. Bacteriol. 179:6228-6237, 
1997; Vagner et al., Microbiology 144:3097-3104, 1999). 

Prior to the present invention, knockout mutations in E. faecalis 
were generated by conjugation and transposition (see, for example, Teng et al. 

25 Plasmid 39:182-186, 1998) or, alternatively, by insertional single gene 

knockoufe (Qin et al., Antimicrob. Agents Chemother. 42:2883-2888, 1998). 
Using these methods, predominantly insertional mutations were introduced. In 
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this case, there would be no way for removing the antibiotic selection marker 
without reconstituting the target gene of interest. Thus, the disruption methods 
cannot be adapted for the generation of multigenic knockouts. 

In order to functionally identify the genes responsible for MDR 
5 efflux in E. faecalis, we developed a method for disrupting genes by 

homologous recombination. To facilitate the performing of this method, wc 
developed gene-disruption plasmid vectors and transformation and selection 
methods suitable for use in E. faecalis. The single-gene disruption strategy is 
illustrated m Fig. 1. Briefly, a fragment of the target gene is generated froi^ 

10 the E. faecalis genomic DNA by PCR using primers based on the identified 
gene sequence. Generally, the optimal size of the partial-gene fragment is 
approximately 800 bp. Although each primer pair and PCR reaction required 
optimization to generate with fidelity the gene fragment, we successfully 
cloned a very significant proportion of the genes listed in Table 2, including 

15 Abel, Abc2, Abc3, Abc4, Abc5, Abc6, Abc7, Abc8, Abc8, AbclO, Abel 1, 
Abcl3, Abcl4, Abcl5, Abcl6, Abcl7, Abc20, Abc22, Abe 23, Mfl, Mf2, 
Mf3, Mf4, Mf5, Mf6, Mf7, and Matel. PCR-generated gene fragments are 
. then cloned into the BamHl site of the pBS/Kan plasmid vector and 
transformed into E. coli for selection on kanamycin. Each vector construction 

20 is validated by restriction digest mapping using standard methods. The 
knockout vectors are each individually introduced into E. faecalis by 
electroporation into the wild-type strain OGRF1 (see below for protocol). 
Since the pBS/Kan vectof is designed for replication in E. coli and does not 
contain a Gram-positive origin of replication, it fails to autonomously replicate 

25 in Gram-positive bacteria (Teng et al., Plasmid 39: 182-186, 1998). Hence, E. 
faecalis transformants selected on kanamycin are presumed to have the vector 
integrated into the chromosome, as illustrated in Fig. 1. 
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We developed methods to introduce the gene-disruption vectors into 
E. faecalis to generate gene-disruption mutants. Briefly, electrocompetent 
cells were prepared from a 1 liter culture of strain OGRF1 grown in Brain 
Heart Infusion (BHI) medium as follows. Cells were grown until late log 
5 phase, recovered by centrifugation, and the pellet washed in sterile 10% 

glycerol. After four washing steps, the cells were resuspended in 2 ml sterile 
10% glycerol in water and stored at -70°C. Electroporation was performed 
with 3 fig DNA mixed with 50 fil cells in a 0. 1 cm cuvette. An Invitrogen 
Electroporator was used with settings at 50 /iF, 200 Ohm and 1 .25 KV/cm. 

10 One milliliter Todd Hewitt (TH) broth was added immediately after pulsing 
and the cells were incubated at 37°C with agitation at 225 rpm. Transformants 
were selected on BHI agar containing kanamycin at 2 mg/ml and observed 
after 24 hours of growth. Transformation efficiency was consistently 1-6 
transformants per 3 fig of DNA. 

15 The genomic sequence in the TIGR database is derived from strain 

V583. Since strain V583 is a vancomycin-resistant clinical isolate, we chose 
for safety reasons to use the vancomycin-sensitive strain, OGRF1, for all our 
laboratory manipulations. There are small sequence differences between the 
strains, and PCR analysis indicates that OGRF1 lacks the gene products Abc9, 

20 Abcl2 and Smrl. 

The putative MDR-knockout mutants were examined individually 
utilizing a PCR-based strategy to demonstrate that integration of the knockout 
plasmid had occurred at the targeted site via homologous recombination. The 
results of this process verified that the integration events were targeted at the 

25 appropriate site in a very high proportion of the transformants examined. We 
generated a large panel of gene knockout mutants using the techniques 
described herein. 
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Example 3: Methods for Determining Growth Inhibition 

One can readily test bacteria for growth inhibition by test substances 
(e.g., natural product extracts) using visual observation (MIC determination) or 
turbidity (absorbance at 650 nm; IC 50 determination). Growth inhibition is 
5 calculated as the percentage decrease in absoibency, compared to the positive 
control cells (e.g., the same bacterial 3train in the absence of any test 
substance). We have developed a 96-well microplate-based assay that is 
suitable for drug-susceptibility testing of E. faecalis. In this assay, E. faecalis 
is inoculated into BHI broth at OD^ = 0.08 to 0.1. This stock cell suspension 

10 is diluted 1 :300 into BHI medium, hud 98 fil of the cell suspension is added to 
2 jil of antibacterial agent in a micnmter plate and grown overnight at 37°C. 
When the disruption mutants are being tested, the appropriate antibiotic (2 
mg/ml kanamycin) is added into the BHI medium. Turbidity readings are 
measured at OD 650nm and used to calculate the IC 50 values. MICs are also 

15 recorded at this time. 

An example of the foregoing method now follows, in which 
single-gene disruption mutants were contacted with one of more than 40 
different antimicrobial agents from disparate chemical classes. A large 
proportion of the single-gene disruption mutants did not exhibit sensitivity in 

20 comparison with the wild-type strain, OG1RF. In 5. cerevisiae, it is known 

that several MDR pumps have overlapping sets of substrate specificities so that 
single-MDR-gene disruption strains may exhibit marginal drug sensitivity. In 
contrast, multiple-MDR-gene knockout strains exhibit markedly increased 
hypersensitivity as compared to the respective single-MDR-gene disruption 

25 (Mahe et al., J. Biol. Chem. 271:25167-25172, 1996; Fleckenstein et al., Yeast 
15:133-137, 1999). We now demonstrate that multiple-MDR-gene disruption 
strains of E. faecalis also exhibit increased hypersensitivity as compared to 
single MDR-gene knockout mutants (see Example 4). Moreover, using the 

-30- 



WO 01/79257 PCT/US01/12230 

single-gene disruption mutants, several MDR-efflux pumps were identified in 
E. faeccdis along with their spectrum of drug substrates (Table 3)- These MDR 
efflux pumps required functional analysis for unequivocal identification since, 
in most cases, sequence comparison analysis alone was 
5 inadequate. For example, Abc7, Abcl6, and Abc23 are not particularly distinct 
from the numerous other ABC genes (e.g., B. &&tilis NatA\ Cheng et al., Mol. 
Microbiol. 23:1107-1120, 1997), and the sequence similarity alone gives no 
indication that these transporters are MDR efflux pumps (Table 2). 

The single-gene MDR-disruption strains were tested against known 

10 antibacterial agents at multiple drug concentrations to define the MIC (Table 
3). A subset of the single-gene MDR-disruption strains that exhibited drug 
sensitivity was selected for more detailed analysis and the dose-response curve 
and ICsq were determined for each. A comparison of the growth of 
gene-disruption strains ANorA, AAbc7, AAbclo and AAbc23 versus the 

15 wild-type strain in the presence and absence of antibacterial agents is shown in 
Fig. 2. 
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Table 3 



Gene 


Drug 


MIC Gig/ml) 


WT/Agene ratio 


knockout 




Wl iff— I VIM 1 vi 1*11111 
vruu a *ijrj^fc oil <uix 


AnnnA 




AAbcl 


Doxorubicin 


>1UU 




64 




Daunorubicin 


25 


3.12 


8 




Ethidium Bromide 


25 


6.25 


4 




Ofloxacin 


125 


3.12 


4 




Erythromycin 


1.56 


0.78 


2 




Gentamicin 


50 


25 


2 




Mithramvcin 

1 II %4 1 1 IT Will 


0.2 


.01 


2 




Norfloxacin 


6.25 


3.12 


2 




Novobiocin 


12.5 


6.25 


2 




Sanguinarine 


12.5 


6.25 


2 


AAbcl6 


■ RrviJlTOTTlVCITl 
l~tL J \XU.\JLllj Will 


1.56 


0.1 


16 




DaunfimhiciTi 


25 


12.5 


2 




Sanguinarine 


12.5 


6.25 


2 


Mbc23 


Clindamycin 


25 


0.05 


500 




Lincomycin 


50 


0.1 


500 




Vi roi x\\ n m vr*i n 

V JUL £^111 Cully k> 111 


100 


1.56 


64 




Synercid 


25 


0.39 


64 


ANorA 


Acriflavin 


12.5 


6.25 


2 




Ciprofloxacin 


136 


0.78 


2 




Clindamycin 


50 


25 


2 




Erythromycin 


136 


0.78 


2 




Norfloxacin 


6.25 


3.12 


2 




Novobiocin 


12.5 


6.25 


2 


Bmr 


Clindamycin 


50 


25 


2 




Erythromycin 


1.56 


0.78 


2 



Example 4: Generation of Multiple Gene-Knockouts in a Single Strain 

10 The single-gene disruption strategy described in Example 2 

precludes the iterative use of the pBS/Kan vector to generate multiple insertion 

mutations in a single strain because of the presence of the kanamycin 

selectable marker following the first knockout Furthermore, the re-use of the 

same plasmid backbone would likely result in homologous recombination in 

15 any subsequently introduced plasmid with shared sequence. To circumvent the 

foregoing problems, we developed an alternative strategy to develop methods 

for the generation of multiple site-specific disruption mutations in a single 

strain. We developed molecular tools to enable the facile generation of 
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site-directed multiple-knockout mutations utilizing an iterative process of (i) 
plasmid insertion and selection followed by (ii) screening for excision and 
target-gene deletion. The plasmid vectors and gene-deletion scheme are 
illustrated in Fig. 3 and described in detail below. 
5 Targeted in-frame deletions are constructed in three steps. In the 

first step, a vector is constructed that contains sequences flanking the region to 
be deleted. In the second step, this vector is inserted into the bacterial 
chromosome by homologous recombination (referred to as a crossover) with 
one of these flanking regions (sequence A or sequence B in Fig. 3). Bacteria 
10 in which the first crossover has occurred are selected for their resistance to 

kanamycia In the third step, a second crossover occurs with the other flrnking 
region, resulting in loss of the inserted vector (including the gene conferring 
resistance to kanamycin) and of the target gene in the chromosome (Fig. 3). 
These three steps are described in more detail below. 

15 Step 1. Construction of a Vector with Flanking Homologies by Crossover PCR 
Primers are designed that amplify regions flanking the gene of 
interest PCR products of both flanking regions are preferably at least 500 bp, 
and, even more preferably, at least 1 kb. PCR products should contain bases 
that overlap between them so that they can hybridize in the second round of 

20 PCR, plus an integral number of codons from the gene of interest to prevent 
polarity in the deletion. In one example, the following bases are added to the 
3' primer on the N-terminal end of the gene: 

S'-CCCATCCACTAAACTTAAACA -3' (SEQ ID NO: 69), while these 
following bases are added to the 5* primer on the C-terminal end of the gene: 
25 5' -TCITTAAGTTTAGTGGATGGG -3' (SEQ ID NO: 70). During the 
first round of PCR, two products will be made, one for each flanking region. 
The products are isolated on a gel to select the correct sized product. 
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During a second PCR step, 1 fil of each flanking region product is 
combined and used as a template. In this reaction, only the outside primers are 
used (i.e., the 5* primer from the amplification if the upstream fragment and 
the 3* primer from the amplification of the downstream fragment), and at a 
5 10-fold higher concentration of each. The templates anneal to each other by 
their complementary 'bases and, thus, the outside primers will amplify the 
combined product (sequences A and B joined together). The product is 
separated on a gel, the correct-sized band is excised, and the isolated DNA is 
ligated into the vector. 

10 Step 2. Insertion into, the Chromosome 

The completed vector, including the insert, is transformed into 
bacterial cells using the standard electroporation protocol with selection for 
kanamycin resistance. PCR and/or Southern blot may be used to verify 
targeted integration of this construct into the chromosome. 

15 Step 3. Isolation of Deletion Mutants 

This protocol is based on replicate plating of the strain with the 
inserted construct. In one example, cultures are first grown overnight in BHI 
at 37°C until they reach stationary phase. The overnight culture is diluted 
100-fold in fresh media and grown at 37°C for an additional 90 minutes. One 

20 hundred microliters from 5- to 10-fold dilution is plated to obtain 

approximately 200 colonies on a TH agar plate (per liter: 30 g TH broth, 15 g 
agar). The colonies are transferred from the original plate onto a piece of 
sterile velvet cloth using a replica plating block and a metal clamp which has 
been pre-marked on one side for orientation. The cells are transferred from the 

25 velvet cloth, first onto a TH agar plate containing the required antibiotic 

marker and then to a plain TH agar plate. All three plates are marked on the 
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corresponding sides to align them with the marking on the metal clamp. The 
plates are then incubated at 37°C for 16 hours. 

The marking on the TH agar plate is aligned with the marking on the 
TH agar selection plate and colonies growing only on the TH agar plate and 
5 not on the selection plate are identified. These cells may be plated onto fresh 
TH agar plates as well as TH agar selection plates for retest. Colonies 
that grow only on TH agar and not on the selection plates are chosen and 
checked for the specific deletion by PCR and Southern blot hybridization. 

In the case of an MDR gene deletion mutant, the drug sensitivity 
10 phenotype is verified by the determination of drug MIC and/or IC 50 in 
comparison to that of the wild-type parent strain. 

The vectors and methodology for generating multiple MDR 
knockouts were used to generate a mutant with a deletion in the NorA gene. 
This ANorA mutant can serve as a recipient in transformation experiments 
15 using all the single-gene knockout constructs listed in Table 2. Hence, using 
the methods developed herein we can now rapidly generate two-gene MDR 
knockout mutants. We have generated such mutants (ANorAAAbcl, 
ANorAAAbc2 9 ANorAAAbc5 9 ANorAAAbc6, ANorAAAbc7 9 ANorAAAbc8 9 
ANorAAAbclO, ANorAAAbcl3 9 ANorAAAbcl6 9 ANorAAAbcl7 9 
20 ANorAAAbc22 9 ANorAAAbc23 9 ANorAAMf2 9 ANorAAMf3 9 ANorAAMfc 
ANorAAj7 9 and ANorAAMatel). Significantly, the ANorAAAbcl6 double 
mutant has the predicted phenotype since it exhibits increased dual sensitivity 
to norfloxacin and erythromycin (attributed to the combination of ANorA and 
AAbcl6 9 respectively). The ANorAAAbc7 and ANorAAAbc23 double mutants 
25 also exhibit increased drug sensitivity. 

The foregoing method can be repeated an unlimited number of 
times, thus allowing for the generation of multigene-knockout bacteria (e.g., E. 
faecalis). These bacteria are, in turn, useful for screening of test substances for 
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the identification of novel antibacterial agents, as described in Example 5. The 
method is readily adaptable to any bacterium in which homologous 
recombination can occur. 

Example 5: Use of Multi-MDR Gene-knockout Bacteria for Identification of 
5 Novel Antibacterial Agents 

Bacterial strains having mutations in two or more MDR efflux 
pumps are likely to be hypersensitive to antibacterial agents. Accordingly, 
these strains are useful in assays to identify norel antibacterial compounds. 
Methods of making multi-MDR gene disrupted bacteria and methods for 

10 measuring bacterial cell growth or metabolism ^re described herein. In the 

screening assays of the present invention, test substances are contacted with the 
bacterial strains for a period of time sufficient to detect any inhibition of 
growth or metabolism. If the test substance that inhibits growth or metabolism 
of a bacterial cell is a collection of compounds, the test substance can be 

15 fractionated using any standard technique to further characterize the active 
compound. If the test substance is a substantially pure compound, this 
compound can be chemically derivatized in order to attempt to find a related 
molecule that has greater activity or that is pumped less efficiently. The assay 
can be adapted for high throughput screening. Methods suitable for high 

20 throughput are described herein. 

Example 6: MDR Efflux Pumps as Drug Targets 

MDR efflux pumps in bacterial cells decrease the sensitivity of the 
cells to antibacterial agents by pumping the agents out of the cells. Thus, a 
compound that blocks efflux of the agent or decreases expression of the pump 
25 will make bacterial cells more sensitive to antibacterial agents. The MDR 
efflux pumps described herein are accordingly novel targets for these 
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compounds. 

Test substances can be screened for their ability to decrease drug 
efflux. In one example, an E. faecalis MDR efflux pump is contacted with a 
test substance, and the ability of a compound in the test substance to bind to 
5 the polypeptide is then determined. Any of a number of binding assays known 
in the art can be performed. In another example, the ability of a test substance 
to decrease expression of an MDR efflux pump is determined. The method 
includes the steps of: (a) providing a cell expressing the MDR efflux pump; (b) 
contacting the cell with the test substance; and (c) measuring expression of the 

10 MDR efflux pump in the cell. Decreased polypeptide expression, relative to a 
cell not contacted with the test substance, indicates that the test substance 
decreases expression of an MDR efflux pump. Expression of the MDR efflux 
pump can be determined by measuring protein levels of the rvJDR efflux pump 
or by measuring levels of RNA encoding the MDR efflux pump. Finally, 

1 5 efflux of an antibacterial agent by an MDR efflux pump of the present 

invention can be determined in whole cells in the presence and absence of a 
test substance. In this assay, decreased efflux in the presence of the test 
substance identifies the test substance as containing a compound that blocks 
efflux through the MDR efflux pump. 

20 In a further example, an E. faecalis gene encoding an MDR efflux 

pump is introduced into a surrogate bacterial host (e.g., R subtilis, L lactis) 
that lacks the MDR gene. The ability of a compound in the test substance to 
block efflux of an antibacterial agent by the MDR efflux pump from the 
surrogate bacterial host can be ascertained by measuring sensitivity to the 

25 antibacterial agent as measured by cell growth. Hence, the MDR efflux pump 
can be expressed in a heterologous bacterial expression host such that the 
introduced MDR efflux pump is the only E. /aecaiis-derived gene expressed in 
the surrogate bacterial host. 
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Compounds identified as blocking MDR pump efflux can be used to 
increase the sensitivity of an E. faecalis cell to an antibacterial agent For 
example, the cell can be contacted with a compound that blocks efflux of the 
antibacterial agent from the cell by binding to an MDR efflux pump. 
5 Alternati vely, the cell can be contacted with a compound that blocks efflux of 
the antibacterial agent from the cdl by decreasing the expression of an&ZDR 
efflux pump. 

Example 7: Expression of bacterial MDR efflux pumps in non-bacteria} icells 
' The MDR efflux pumps of the present invention can be expressed in 

10 non-bacterial cells such as yeast cells or mammalian cells (e.g., human :ills) in 
order to screen for compounds that block their efflux of antibacterial agents. 
For example, the LmrA MDR gene from L lactus has been functionally 
expressed in GM0637 human lung fibroblast cells (van Veen et al., Nature 
391:391-295, 1998). The lung fibroblast cells expressing the LmrA protein 

15 showed a 10-60 fold increased resistance to a variety of natural product drugs 
and synthetic chemotherapeutic drugs, which are typical substrates of human 
P-glycoprotein. The functional complementation of the human MDR P- 
glycoprotein with an expressed bacterial MDR gene allows the use of 
numerous methods for the discovery and characterization of agents that inhibit 

20 bacterial MDR efflux in mammalian-cell based assays (reviewed in Sharom, J. 
Memb. Biol. 160: 161-175, 1997 and Stein, Physiol. Rev. 77:545-590, 1997, 
each of which is hereby incorporated by reference). These approaches involve 
various assay formats for the measurement of the modulation by MDR 
"reversal agents" or blockers of compound efflux mediated by MDR pumps. 

25 Examples include the measurement, in the presence and absence of a test 
substance, of: (i) cytotoxicity mediated by an MDR-pump substrate (e.g. 
doxorubicin); (ii) cellular uptake of a radiolabeled MDR-pump substrate (e.g. 
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[ 3 H]daunorubicin); and (iii) accumulation of a fluorescent MDR-pump 
substrate (e.g. Rhodamine 123) (Bosch et al., Leukemia 11:1131-1137, 1997). 

In another example, the bacterial MDR gene, EmrE from E. coli, has 
been expressed in the yeast Saccharomyces cerevisiae and shown to confer 
5 resistance to a wide variety of drugs, including acriflavin, ethidium, and methyl 
viologen (Ydin et all; J. Bacteriol. 181:949-956, 1999). Although the EmrE 
protein pumped drugs into the vacuolar compartment rather than into the 
exterior of the cell, this report further indicates the feasibility of expressing 
bacterial MDR gene*; in a non-bacterial background. As with the mammalian 
10 cell-based MDR expression system described previously, lower-eukaryote 

heterologous expression hosts can also be used as the basis for the discovery of 
MDR inhibition agents. 

Example 8: Cell-free assays for the identification of compounds that block 
MDR efflux pumps 

15 In addition to performing screening assays in non-bacterial cells, 

assays can also be performed using cell free assays. The bacterial MDR LmrP 
protein from L lactus has been purified, reconstituted in dodecylmaltoside- 
destabilized preformed liposomes, and shown to mediate the transport of 
multiple drugs in response to an artificially-imposed pH gradient (Putman et 

20 al., Biochemistry 38:1002-1008, (1999). These studies demonstrate the 

applicability of using cell-free systems for the discovery of blockers of drug 
efflux mediated by MDR proteins. The transport assays described above (e.g., 
using radiolabeled [ 3 H]daunorubicin or Rhodamine 123) can be readily 
adapted to the cell free assays. 
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Test Substances 

In general, test substances and compounds are identified from large 
libraries of both natural product and synthetic (or semi-synthetic) extracts or 
chemical libraries according to methods known in the art Those skilled in the 
5 field of drug discovery and development will understand that the precise 
source of the test substance is not critical to the screening procedure(s) of the 
invention. Accordingly, virtually any number of chemical extracts or 
compounds can be screened using the methods described herein. Examples of 
such extracts or compounds include, but are not limited to, plant-, fungal-, 

10 prokaryotic- or animal-based extracts, fermentation broths, and synthetic 
compounds, as well as modification of existing compounds. Numerous 
methods are also available for generating random or directed synthesis (e.g., 
semi-synthesis or total synthesis) of any number or chemical compounds, 
including, but not limited to, saccharide-, lipid-, peptide-, and nucleic acid- 

15 based compounds. In addition, natural and synthetically produced libraries are 
produced, if desired, according to methods known in the art, e.g., by standard 
extraction, fractionation, or synthetic methods. Furthermore, if desired, any 
library or compound is readily modified using standard chemical, physical, or 
biochemical methods. 

20 In addition, those skilled in the art of drug discovery and 

development readily understand that methods for dereplication (e.g., 
taxonomic dereplication, biological dereplication, and chemical dereplication, 
or any combination thereof) or the elimination of replicates or repeats of 
materials already known for their anti-pathogenic activity should be employed 

25 whenever possible. 

When a test substance, as a mixture, is found to inhibit growth or 
metabolism of a bacterial cell, further fractionation of the positive lead extract 
may be necessary to isolate chemical constituents responsible for the observed 
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effect. Thus, the goal of the extraction, fractionation, and purification process 
is the careful characterization and identification of a chemical entity having the 
desired activity within the test substance. Methods of fractionation and 
purification of test substances are known in the art If desired, test substances 
5 shown to be useful agents for the inhibition of bacterial growth or metabolism 
are chemically modified according to methods known in the art 

» 

Pharmaceutical Therapeutics and Plant Protectants 

The invention provides a simple means for identifying compounds 
(including peptides, small molecule inhibitors, and mimetics) capable of 

10 inhibiting bacterial growth or metabolism. Accordingly, antibacterial agents- 
discovered to have medicinal or agricultural value using the methods described 
herein are useful as either drugs, plant protectants, or as information for 
structural modification of existing anti-bacterial compounds, e.g., by rations! 
drug design. Such methods are useful for screening compounds having an 

15 effect on a variety of bacteria. 

For therapeutic uses, the compositions or agents identified using the 
methods disclosed herein may be administered systemically, for example, 
formulated in a pharmaceutically-acceptable buffer such as physiological 
saline. Suitable routes of administration include, for example, orally, by 

20 inhalation, or by subcutaneous, intravenous, interperitoneal, intramuscular, or 
intradermal injections which provide continuous, sustained levels of the drug 
in the patient. Treatment of human patients or other animals will be carried out 
using a therapeutically effective amount of an anti-bacterial agent in a 
physiologically-acceptable carrier. The amount of the anti-bacterial agent to be 

25 administered varies depending upon the manner of administration, the age and 
body weight of the patient, and with the type of disease and extensiveness of 
the disease. Generally, amounts will be in the range of those used for other 
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agents used in the treatment of other bacterial diseases, although in certain 
instances lower amounts will be needed because of the increased specificity of 
the compound, A compound is administered at a dosage that inhibits bacterial 
growth or metabolism. For example, for systemic administration a compound 
5 is administered typically in the range of 0. 1 fig to 10 g/kg body weight. 

In addition, the antibacterial agent may be added to materials used to 
make catheters, including but not limited to intravenous, urinary, 
intraperitoneal, ventricular, spinal and surgical drainage catheters, in order to 
prevent colonization and systemic seeding by potential pathogens. Similarly, 
• 10 the antibacterial agent may be added to the materials that constitute various 
surgical prostheses and to dentures to prevent colonization by pathogens and 
thereby prevent more serious invasive infection or systemic seeding by 
pathogens. 

For agricultural uses, the compositions or agents identified using the 
15 methods disclosed herein may be used as chemicals applied as sprays or dusts 
on the foliage of plants, or in irrigation systems. Typically, such agents are to 
be administered on the surface of the plant in advance of the pathogen in order 
to prevent infection. Seeds, bulbs, roots, tubers, and conns are also treated to 
prevent pathogenic attack after planting by controlling pathogens carried on 
20 them or existing in the soil at the planting site. Soil to be planted with 

vegetables, ornamentals, shrubs, or trees can also be treated with chemical 
fumigants for control of a variety of bacterial pathogens. Treatment can be 
done several days or weeks before planting. The chemicals can be applied by 
either a mechanized route, e.g., a tractor or with hand applications. In addition, 
25 chemicals identified using the methods of the assay can be used as 
disinfectants. 
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Other Embodiments 

The present invention has been described in terms of particular 
embodiments found or proposed by the present inventors to include exemplary 
modes for the practice of the invention. It will be appreciated by those of skill 
in the art that, in light of the present disclosure, numerous modifications and 
changes can be made in the particular embodiments exemplified without 
departing from the intended scope of the invention. All such modifications are 
intended to be included within the scope of the invention. 

What we claim is: 
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1. A method of determining whether a candidate nucleotide 
sequence encodes an MDR efflux pump, said method comprising the steps of: 

(a) searching a database of nucleotide sequences for sequences 
having high identity to a sequence encoding a known MDR efflux pump to 

5 generate a first set of one or more candidate sequences comprising sequences 
that have high identity to tik, sequence encoding the known MDR efflux pump; 

(b) from the first set of candidate sequences, selecting one or more 
sequences that include a sequence encoding one or more potential 
transmembrane domains to generate a second set of candidate sequences; 

10 (c) in a bacterial cell, mutating a gene corresponding to one of the 

second set of candidate sequences; and 

(d) determining whether the bacterial cell exhibits increased 
sensitivity to an antibacterial agent wherein the candidate sequence encodes an 
MDR efflux pump if the bacterial cell exhibits increased sensitivity to the 

15 antibacterial agent. 
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2. A method of determining whether a polypeptide sequence 
functions as an MDR efflux pump, said method comprising the steps of: 

(a) searching a database of polypeptide sequences for sequences 
having high identity to a polypeptide sequence known to function as an MDR 

5 efflux pump to generate a first set of one or more candidate polypeptides; 

4 

(b) from the first set of candidates, selecting o.ic or more that have 
potential transmembrane domains to generate a second set of candidate 
polypeptides; 

(c) in a bacterial cell, mutating a gene encoding one of the second set 
10 of candidate polypeptides; and 

(d) determining whether the bacterial cell exhibits decreased 
sensitivity to antibacterial agents, wherein the candidate polypeptide functions 
as an MDR efflux pump if the cell exhibits decreased sensitivity to an 
antibacterial agent. 
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3. A method for deleting a desired region of DNA in a bacterial cell, 
said method comprising the steps of: 

(a) transforming bacterial cells with a vector comprising (i) a first 
region of at least 30 nucleotides substantially identical to a first region of 
5 chromosomal DNA in the bacterial cells; (ii) a second region of at least 30 
nucleotides substantially identical to a second region of chromosomal DNA in 
the bacterial cells; and (iii) a third region encoding a polypeptide that provides 
resistance to a selection agent, wherein the first and second regions of 
chromosomal DNA are on opposite sides of the region of DNA to be deleted; 

10 (b) selecting for bacterial cells in which the vector of step (a) has 

integrated by a first crossover event into the chromosomal DNA by adding the 
selection agent to the culture medium; and 

(c) culturing the cells selected for in step (b) in the absence of a 
selection agent to allow for a second crossover event to occur in at least a 

15 subset of the cells, wherein the second crossover event results in the loss of (i) 
the desired region of DNA and (ii) the region encoding the polypeptide that 
provides resistance to the selection agent in that subset of cells. 

4. The method of claim 3, wherein steps (a)-(c) are repeated one or 
more times to delete additional regions of DNA in the same bacterial cell from 

20 which the first region was deleted. 
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5. A method for deleting two regions of DNA in a bacterial cell, 
said method comprising the steps of: 

(a) transforming bacterial cells with a vector comprising (i) a first 
region of at least 30 nucleotides substantially identical to a first region of 

5 chromosomal DNA in the bacterial cells; (ii) a second region of at least 30 
nucleotides substantially identical to a second region of chromosomal DNA in 
the bacterial cells; and (iii) a third region encoding a polypeptide that provides 
resistance to a selection agent, wherein the first and second regions of 
chromosomal DNA are on opposite sides of a first region of DNA to be 
10 deleted; 

(b) selecting for bacterial cells in which the vector of step (a) has 
integrated by a first crossover event into the chromosomal DNA by adding the 
selection agent to £he culture medium; 

(c) culluring the cells selected for in step (b) in the absence of a 
15 selection agent to allow for a second crossover event to occur in at least a 

subset of the cells, wherein the second crossover event results in the loss of (i) 
the first region of DNA and (ii) the region encoding the polypeptide that 
provides resistance to the selection agent in that subset of cells; 

(d) transforming bacterial cells resulting from step (c) with a vector 
20 comprising (i) a first region of at least 30 nucleotides substantially identical to 

a first region of chromosomal DNA in the bacterial cells; (ii) a second region 
of at least 30 nucleotides substantially identical to a second region of 
chromosomal DNA in the bacterial cells; and (iii) a third region encoding a 
polypeptide that provides resistance to a selection agent, wherein the first and 
25 second regions of chromosomal DNA are on opposite sides of a second region 
of DNA to be deleted; 

(e) selecting for bacterial cells in which the vector of step (d) has 
integrated by a first crossover event into the chromosomal DNA by adding the 
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selection agent to the culture medium; 

(f) culturing the cells selected for in step (e) in the absence of a 
selection agent to allow for a second crossover event to occur in at least a 
subset of the cells, wherein the second crossover event results in the loss of (i) 
5 the second region of DNA and (ii) the region encoding the polypeptide that 
provides resistance to the selection agent in that subset of cells. 

6. The method of claim 5, wherein said bacterial cell is selected 
from the group consisting of Staphylococcus aureus, Streptococcus pyogenes, 
Bacillus anthracis, Clostridium tetani, Clostridium bolulinum, Vibrio 
10 cholerae, Helicobacter pylori. Salmonella typhimurium, Shigella dysenteriae, 
Bordetella pertussis, Yersinia pestis, Pseudomonas aeruginosa, Neisseria 
gonorrhoeae, Mycobacterium tuberculosis, Coryhebacterium diptheriae, 
Borrelia burdorferi, Treponeuia pallidum, Enterococcus faecalis, 
Enterococcus faecium, and Streptococcus pneumoniae. 
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7. A method for determining whether a test substance inhibits the 
growth or metabolism of cells of a strain of Enterococcus faecalis having a 
disruptive mutation in a gene encoding a protein selected from the group 
consisting of Abel (SEQ ID NO: 2), Abc2 (SEQ ID NO: 4), Abc3 (SEQ ID 
5 NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ ID NO: 10), Abc6 (SEQ ID NO: 
12), Abc7 (SEQ ID NO: 14), Abc8 (SEQ SD NO: 16), Abc9 (SEQ ID NO: 18), 
AbclO (SEQ ID NO: 20), Abel 1 (SEQ ID NO: 22), Abcl2 (SEQ ID NO: 24), 
Abcl3 (SEQ ID NO: 26), Abcl4 (SEQ ED NO: 28), Abcl5 (SEQ ID NO: 30), 
Abcl6 (SEQ ED NO: 32), Abcl7 (SEQ ID NO: 34), Abcl8 (SEQ ID NO: 36), 

10 Abcl9 (SEQ ID NO: 38), Abc20 (SEQ ED NO: 40), Abc21 (SEQ ID NO: 42), 
Abc22 (SEQ ED NO: 44), Abc23 (SEQ ID NO: 46), Bmr (SEQ ID NO: 48), 
NorA (SEQ ED NO: 50), Mfl (SEQ ID NO: 52), Mf2 (SEQ ID NO: 54), Mf3 
(SEQ ID NO: 56), Mf4 (SEQ ID NO: 58), MB (SEQ ID NO: 60), Mf6 (SEQ 
ED NO: 62), Mf7 (SEQ ID NO: 64), Matei (SEQ ED NO: 66), and Smrl (SEQ 

15 ED NO: 68), said method comprising the steps of: 

(a) contacting the cells with a test substance; and 

(b) determining whether the growth or metabolism of the cells is 
inhibited, wherein an inhibition of growth or metabolism, relative to the 
growth or metabolism of cells of the strain of bacteria having the mutation but 

20 not contacted with the test substance, identifies the test substance as one 
having the potential use as an antibacterial agent. 
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8. The method of claim 7, wherein the bacterial strain has disruptive 
mutations in genes encoding at least two of the proteins selected from the 
group consisting of Abel (SEQ ID NO: 2), Abc2 (SEQ ID NO: 4), Abc3 (SEQ 
ID NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ ID NO: 10), Abc6 (SEQ ID 
5 NO: 12), Abc7 (SEQ ID NO: 14), Abc8 (SEQ ED NO: 16), Abc9 (SEQ ID 
NO: 18), AbclO (SEQ ID NO: 20), Abel 1 (SEQ ED NO: 22), Abcl2 (SEQ ED 
NO: 24), Abcl3 (SEQ ED NO: 26), Abcl4 (SEQ ID NO: 28), Abcl5 (SEQ ED 
NO: 30), Abcl6 (SEQ ID NO: 32), Abcl7 (SEQ ID NO: 34), Abcl8 (SEQ ID 
NO: 36), Abcl9 (SEQ ID NO: 38), Abc20 (SEQ ID NO; 40), Abc21 (SEQ ID 
10 NO: 42), Abc22 (SEQ ED NO: 44), Abc23 (SEQ ID NC: 46), Bmr (SEQ ED 
NO: 48), NorA (SEQ ID NO: 50), Mfl (SEQ ID NO: 52;, Mf2 (SEQ ED NO: 
54), Mf3 (SEQ ID NO: 56), Mf4 (SEQ ID NO: 58), Mf5 (SEQ ED NO: 60), 
Mf6 (SEQ ED NO: 62), Mf7 (SEQ ED NO: 64), Matel (£2Q ED NO: 66), and 
Smrl(SEQEDNO:68). 

15 9. A substantially pure nucleic acid molecule consisting essentially 

of Enterococcusfaecalis Abel (SEQ ED NO: 1), Abc2 (SEQ ED NO: 3), Abc3 
(SEQ ID NO: 5), Abc4 (SEQ ID NO: 7), Abc5 (SEQ ED NO: 9), Abc6 (SEQ ED 
NO: 11), Abc7 (SEQ ED NO: 13), Abc8 (SEQ ED NO: 15), Abc9 (SEQ ID NO: 
17), AbclO (SEQ ID NO: 19), Abcll (SEQ ED NO: 21), Abcl2 (SEQ ID NO: 

20 23), AbcU (SEQ ID NO: 25), AbcU (SEQ ED NO: 27), Abcl5 (SEQ ID NO: 
. 29), Abcl6 (SEQ ED NO: 31), Abcl7 (SEQ ID NO: 33), Abcl8 (SEQ ID NO: 
35), Abcl9 (SEQ ED NO: 37), Abc20 (SEQ ID NO: 39), Abcll (SEQ ED NO: 
41), Abc22 (SEQ ID NO: 43), Abc23 (SEQ ID NO: 45), Bmr (SEQ ED NO: 
47), NorA (SEQ ID NO: 49), Mfl (SEQ ID NO: 51), Mf2 (SEQ ID NO: 53), 

25 Mf3 (SEQ ID NO: 55), Mf4 (SEQ ID NO: 57), Mf5 (SEQ ID NO: 59), Mf6 
(SEQ ID NO: 61), Mf7 (SEQ ED NO: 63), Matel (SEQ ID NO: 65), or Smrl 
(SEQ ID NO: 67). 
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10. A substantially pure polypeptide comprising a sequence selected 
from the group consisting of Enterococcus faecalis Abel (SEQ ID NO: 2), 
Abc2 (SEQ ID NO: 4), Abc3 (SEQ ID NO: 6), Abc4 (SEQ ID NO: 8), Abc5 
(SEQ ID NO: 10), Abc6 (SEQ ID NO: 12), Abc7 (SEQ ID NO: 14), Abc8 

5 (SEQ ID NO: 16), Abc9 (SEQ ED NO: 18), AbclO (SEQ ID NO: 20), Abel 1 
(SEQ ID NO: 22), Abcl2 (SEQ ID NO: 24), Abcl3 (SEQ ID NO: 25), Abcl4 
(SEQ ID NO: 28), Abcl5 (SEQ ID NO: 30), Abcl6 (SEQ ID NO: 32), Abcl7 
(SEQ ID NO: 34), Abel 8 (SEQ ID NO: 36), Abcl9 (SEQ ID NO: 38), Abc20 
(SEQ ID NO: 40), Abc21 (SEQ ID NO: 42), Abc22 (SEQ ID NO: <4), Abc23 
10 (SEQ ID NO: 46), Bmr (SEQ ID NO: 48), NorA (SEQ ID NO: 50); Mfl (SEQ 
ID NO: 52), Mf2 (SEQ ID NO: 54), Mf3 (SEQ ID NO: 56), Mf4 (SEQ ID 
NO: 58), Mf5 (SEQ ID NO: 60), Mf6 (SEQ ID NO: 62), Mf7 (SEQ ID NO: 
64), Matel (SEQ ID NO: 66), and Smrl (SEQ ID NO: 68). 

11. An Enterococcus faecalis bacterial cell having a disruptive 
15 mutation in a gene encoding a protein selected from the group consisting of 

Abel (SEQ ID NO: 2), Abc2 (SEQ ID NO: 4), Abc3 (SEQ ID NO: 6), Abc4 
(SEQ ID NO: 8), Abc5 (SEQ ID NO: 10), Abc6 (SEQ ED NO: 12), Abc7 
(SEQ ED NO: 14), Abc8 (SEQ ID NO: 16), Abc9 (SEQ ID NO: 18); AbclO 
(SEQ ED NO: 20), Abel 1 (SEQ ID NO: 22), Abcl2 (SEQ ID NO: 24), Abcl3 

20 (SEQ ED NO: 26), Abcl4 (SEQ ID NO: 28), Abcl5 (SEQ ID NO: 30), Abcl6 
(SEQ ID NO: 32), Abcl7 (SEQ ID NO: 34), Abcl8 (SEQ ED NO: 36), Abcl9 
(SEQ ED NO: 38), Abc20 (SEQ ID NO: 40), Abc21 (SEQ ID NO: 42), Abc22 
(SEQ ID NO: 44), Abc23 (SEQ ID NO: 46), Bmr (SEQ ED NO: 48), NorA 
(SEQ ID NO: 50), Mfl (SEQ ID NO: 52), Mf2 (SEQ ID NO: 54), Mf3 (SEQ 

25 ED NO: 56), Mf4 (SEQ ID NO: 58), Mf5 (SEQ ED NO: 60), Mf6 (SEQ ED 

NO: 62), Mf7 (SEQ ED NO: 64), Matel (SEQ ID NO: 66), and Smrl (SEQ ED 
NO: 68). 
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12. The bacterial cell of claim 11, wherein the bacterial strain has 
disruptive mutations in genes encoding at least two of the proteins selected 
from the group consisting of Abel (SEQ ID NO: 2), Abc2 (SEQ ID NO: 4), 
Abc3 (SEQ ID NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ ID NO: 10), Abc6 
5 (SEQ ID NO: 12), Abc7 (SEQ ED NO: 14), Abc8 (SEQ ID NO: 16), Abc9 
(SEQ ID NO: 1?>, AbclO (SEQ ID NO: 20), Abel 1 (SEQ ED NO: 22), Abcl2 
(SEQ ED NO: 24), Abcl3 (SEQ ID NO: 26), Abcl4 (SEQ ED NO: 28), Abcl5 
(SEQ ID NO: 30), Abcl6 (SEQ ID NO: 32), Abcl7 (SEQ ED NO: 34), Abcl8 
(SEQ ID NO: 36); Abcl9 (SEQ ID NO: 38), Abc20 (SEQ ED NO: 40), Abc21 
10 (SEQ ID NO: 42), Abc22 (SEQ ID NO: 44), Abc23 (SEQ ED NO: 46), Bmr 
(SEQ ED NO: 48); NorA (SEQ ED NO: 50), Mfl (SEQ ID NO: 52), Mf2 (SEQ 
ID NO: 54), Mf3 (SEQ ED NO: 56), Mf4 (SEQ ID NO: 58), Mf5 (SEQ ED 
NO: 60), Mf6 (SEQ ID NO: 62), Mf7 (SEQ ID NO: 64), Matel (SEQ ED NO: 
66), and Smrl (SEQ ID NO: 68). 
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13. A method for determining whether a test substance includes a 
compound that potentially blocks efflux of antibacterial agents from a cell, said 
method comprising the steps of: 

(a) providing a polypeptide selected from the group consisting of 
5 Enterococcusfaecalis Abel (SEQ ID NO: 2), Abc2 (SEQ ID NO: 4), Abc3 

(SEQ ID NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ ID NO: 10), Abc6 (SEQ 
ID NO: 12), Abc7 (SEQ ID NO: 14), Abc8 (SEQ ID NO: 16), Abc9 (SEQ ID 
NO: 18), AbclO (SEQ ID NO: 20), Abel 1 (SEQ ID NO: 22), Abcl2 (SEQ ID 
NO: 24), Abcl3 (SEQ ID NO: 26), Abcl4 (SEQ ID NO: 28), Abcl5 (SEQ ID 

10 NO: 30), Abcl6 (SEQ ID NO: 32), Abcl7 (SEQ ID NO: 34), Abcl8 (SEQ ID 
NO: 36), Abcl9 (SEQ ID NO: 38), Abc20 (SEQ ID NO: 40), Abc21 (SEQ ID 
NO: 42), Abc22 (SEQ ID NO: 44), Abc23 (SEQ ID NO: 46), Bmr (SEQ ID 
NO: 48), NorA (SEQ ID NO: 50), Mfl (SEQ ID NO: 52), Mf2 (SEQ ID NO: 
54), Mf3 (SEQ ID NO: 56), Uf4 (SEQ ID NO: 58), Mf5 (SEQ ID NO: 60), 

15 Mf6 (SEQ ID NO: 62), Mf7 (SEQ ID NO: 64), Matel (SEQ ID NO: 66), and 
Smrl(SEQIDNO:68); 

(b) contacting the polypeptide with the test substance; and 

(c) determining whether a compound in the test substance binds to 
the polypeptide, wherein binding of the compound to the polypeptide identifies 

20 it as a compound that potentially blocks efflux of antibacterial agents from a 
cell. 

14. The method of claim 13, wherein said method is performed in 

vitro. 

15. The method of claim 13, wherein said method is performed in 

25 vivo. 
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16. The method of claim 13, wherein said polypeptide is expressed 
in a strain other than a strain of Enterococcus faecalis. 

17. The method of claim 13, wherein the strain is a strain of 
Bacillus subtilis or Lactococcus lactis. 

5 

18. A method for determining whether a test substance includes a 
compound that reduces efflux of antibacterial agents from a cell, said method 
comprising the steps of: 

(a) providing a cell expressing a gene encoding an MDR efflux 

10 pump selected from Enterococcus faecalis Abel (SEQ ID NO: 2), Abc2 (SEQ 
ID NO: 4), Abc3 (SEQ ID NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ ID NO: 
10), Abc6 (SEQ ID NO: 12), Abc7 (SEQ ID NO: 14), Abc8 (SEQ ID NO: 16), 
Abc9 (SEQ ID NO: 18), AbclO (SEQ ID NO: 20), Abel 1 (SEQ ID NO: 22), 
Abcl2 (SEQ ID NO: 24), Abe 13 (SEQ ID NO: 26), Abcl4 (SEQ ID NO: 28), 

15 Abcl5 (SEQ ID NO: 30), Abcl6 (SEQ ID NO: 32), Abcl7 (SEQ ID NO: 34), 
Abcl8 (SEQ ID NO: 36), Abcl9 (SEQ ID NO: 38), Abc20 (SEQ ID NO: 40), 
Abc21 (SEQ ID NO: 42), Abc22 (SEQ ID NO: 44), Abc23 (SEQ ID NO: 46), 
Bmr (SEQ ID NO: 48), NorA (SEQ ID NO: 50), Mfl (SEQ ID NO: 52), Mf2 
(SEQ ID NO: 54), Mf3 (SEQ ID NO: 56), Mf4 (SEQ ID NO: 58), Mf5 (SEQ 

20 ID NO: 60), Mf6 (SEQ ID NO: 62), Mf7 (SEQ ID NO: 64), Matel (SEQ ID 
NO: 66), and Smrl (SEQ ID NO: 68); 

(b) contacting the cell with the test substance; and 

(c) measuring expression of the gene encoding the MDR efflux 
pump in the cell, wherein decreased expression, relative to a cell not contacted 

25 with the test substance, identifies the test substance as containing a compound 
that reduces efflux of antibacterial agents from a cell. 
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19. The method of claim 18, wherein expression of the gene 
encoding the MDR efflux pump is determined by measuring protein levels of 
the MDR efflux pump. 

20. The method of claim 18, wherein expression of gene encoding 

5 the MDR efflux pump is determined by measuring levels of RNA encoding the 
MDR efflux pump. 

21. The method of claim 18, wherein said MDR efflux pump is 
expressed in a strain other than a strain of E. faecalis. 

22. The method of claim 21, wherein the strain is a strain of B. 
10 subtilis or L lactis. 
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23. A method for increasing the sensitivity of an E. faecalis cell to 
an antibacterial agent, said method comprising the step of contacting the cell 
with a compound that blocks efflux of the antibacterial agent from the cell by 
binding to an MDR efflux pump selected from the group consisting of E. 
5 faecalis Abel (SEQ ID NO: 2), Abc2 (SEQ ID NO: 4), Abc3 (SEQ ID NO: 6), 
Abc4 (SEQ ID NO: 8), Abc5 (SEQ ID NO: 10), Abc6 (SEQ ID NO: 12), Abc7 
(SEQ ED NO: 14), Abc8 (SEQ ID NO: 16), Abc9 (SEQ ID NO: 18), AbclO 
(SEQ ED NO: 20), Abcll (SEQ ID NO: 22), Abcl2 (SEQ ED NO: 24), Abcl3 
(SEQ ID NO: 26), Abcl4 (SEQ ID NO: 28), Abcl5 (SEQ ED NO: 30), Abcl6 

10 (SEQ ID NO: 32), Abcl7 (SEQ ID NO: 34), Abcl8 (SEQ ID NO. 36),- Abe 19 
(SEQ ID NO: 38), Abc20 (SEQ ID NO: 40), Abc21 (SEQ ED NO: 42), Abc22 
(SEQ ID NO: 44), Abc23 (SEQ ED NO: 46), Bmr (SEQ ED NO: 48), NorA 
(SbQ ID NO: 50), Mfl (SEQ ID NO: 52), Mf2 (SEQ ID NO: 54), Mf3 (SEQ 
W NO: 56), Mf4 (SEQ ID NO: 58), Mf5 (SEQ ED NO: 60), Mf6 (SEQ ED 

15 NO: 62), Mf7 (SEQ ED NO: 64), Matel (SEQ ID NO: 66), and Smrl (SEQ ID 
NO: 68). 
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24. A method for increasing the sensitivity of an E. faeccdis cell to 
an antibacterial agent, said method comprising the step of contacting the cell 
with a compound that blocks efflux of the antibacterial agent from the cell by 
decreasing the expression of a gene encoding an MDR efflux pump selected 
5 from the group consisting of E. faecalis Abel (SEQ ID NO: 2), Abc2 (SEQ ED 
NO: 4), Abc3 ;3EQ ID NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ ID NO: : ' 
10), Abc6 (SEQ ID NO: 12), Abc7 (SEQ ID NO: 14), Abc8 (SEQ ID NO: 16), 
Abc9 (SEQ ID NO: 18), AbclO (SEQ ID NO: 20), Abcll (SEQ ID NO: 22), 
Abcl2 (SEQ ID NO: 24), Abcl3 (SEQ ID NO: 26), Abcl4 (SEQ ID NO: 28), 

10 Abcl5 (SEQ ID NO: 30), Abcl6 (SEQ ID NO: 32), Abcl7 (SEQ ID NO: 34), 
Abcl8 (SEQ DJ NO: 36), Abcl9 (SEQ ID NO: 38), Abc20 (SEQ ID NO: 40), 
Abc21 (SEQ ID NO: 42), Abc22 (SEQ ID NO: 44), Abc23 (SEQ ID NO: 46), 
Bmr (SEQ ID NO: 48), NorA (SEQ ID NO: 50), Mfl (SEQ ID NO: 52), Mf2 . 
(SEQ ID NO: : 54), Mf3 (SEQ ID NO: 56), Mf4 (SEQ ID NO: 58), Mf5 (SEQ ' 

15 DD NO: 60), Mf6 (SEQ ID NO: 62), Mf7 (SEQ ID NO: 64), Matel (SEQ ID 
NO: 66), and Smrl (SEQ ID NO: 68). 
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25. A method for determining whether a test substance functions as 
an inhibitor of an MDR efflux pump, said method comprising the steps of: 

(a) expressing a nucleic acid molecule encoding an MDR efflux 
pump derived from a first bacterial strain in a second bacterial strain to provide 

5 increased resistance to an antibacterial agent relative to the second bacterial 
strain not expressing the MDR efflux pump; 

(b) contacting the second bacterial strain with an amount of the 
antibacterial agent that inhibits growth of a control strain but does not 
substantially inhibit growth of the second bacterial strain; 

10 (c) contacting the second bacterial strain with a test substance; and 

(d) measuring growth of second bacterial strain, wherein decreased 
growth of the bacterial strain identifies the test substaface as one that includes 
an inhibitor of an MDR efflux pump. 

15 26. The method of claim 25, wherein the second bacterial strain is a 

strain of Bacillus subtilis or Lactococcus lactis. 



-58- 



WO 01/79257 



PCT/US01/12230 



27. A method for determining whether a test substance includes a 
compound that reduces efflux of antibacterial agents from a cell, said method 
comprising the steps of: 

(a) providing a non-bacterial cell expressing a nucleic acid molecule 
5 encoding an MDR efflux pump selected from Enterococcus faecalis Abel 

(SEQ ID NO: 2), Abc2 (SEQ ID NO: 4), Abc3 (SEQ ID NO: 6), Abc4 (SEQ 
ID NO: 8), Abc5 (SEQ ID NO 10), Abc6 (SEQ ED NO: 12), Abc7 (SEQ ED 
NO: 14), Abc8 (SEQ ID NO: 16), Abc9 (SEQ ID NO: 18), AbclO (SEQ ID 
NO: 20), Abel 1 (SEQ ID NO: 22), Abcl2 (SEQ ID NO: 24), Abcl3 (SEQ ID 

10 NO: 26), Abcl4 (SEQ ED NO: 28), AbclS (SEQ ID NO: 30), Abcl6 (SEQ ID 
NO: 32), Abcl7 (SEQ ID NO: 34), Abcl8 (SEQ ID NO: 36), Abcl9 (SEQ ID 
NO: 38), Abc20 (SEQ ID NO: 40), Abc21 (SEQ ID NO: 42), Abc22 (SEQ ID 
NO: 44), Abc23 (SEQ ED NO: 46), Bm: (SEQ ID NO: 48), NorA (SEQ ID 
NO: 50), Mf 1 (SEQ ED NO: 52), Mf2 (SEQ ID NO: 54), Mf3 (SEQ ID NO: 

15 56), Mf4 (SEQ ED NO: 58), Mf5 (SEQ ID NO: 60), Mf6 (SEQ ID NO: 62), 
Mf7 (SEQ ID NO: 64), Matel (SEQ ID NO: 66), and Smrl (SEQ ED NO: 68); 

(b) contacting the cell with the test substance; 

(c) contacting the cell with a compound that is capable of (i) 
entering the cell and (ii) being transported by the MDR efflux pump; and 

20 (d) measuring efflux of the compound from the cell, wherein 

decreased efflux, relative to a cell not contacted with the test substance, 
identifies the test substance as containing a compound that reduces efflux of 
antibacterial agents from a cell. 

28. The method of claim 27, wherein the cell is a eukaryotic cell. 
25 29. The method of claim 28, wherein the cell is a mammalian cell. 
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30. The method of claim 28, wherein the cell is an insect cell. 

31. The method of claim 28, wherein the cell is a yeast cell. 

32. The method of claim 31, wherein the yeast is Saccharomyces 
cerevisiae. 
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33. A method for determining whether a test substance includes a 
compound that reduces efflux of antibacterial agents from a cell, said method 
comprising the steps of: 

(a) providing a cell free system consisting of: 

5 (i) a lipid membrane into which is inserted an MDR efflux 

pump selected from Enterococcus faecalis Abel (SEQ ID NO: 2), Abc2 (SEQ 
ID NO: 4), Abc3 (SEQ ID NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ ID 10), 
Abc6 (SEQ ID NO: 12), Abc7 (SEQ ID NO: 14), Abc8 (SEQ ID NO: 16), 
Abc9 (SEQ ID NO: 18), AbclO (SEQ ID NO: 20), Abcll (SEQ ID NO: 22), 

10 Abcl2 (SEQ ID NO: 24), Abcl3 (SEQ ID NO: 26), Abcl4 (SEQ ID NO: 28), 
AbclS (SEQ ID NO: 30), Abcl6 (SEQ IDNO: 32), Abcl7 (SEQ ID NO: 34), 
Abcl8 (SEQ ID NO: 36), Abcl9 (SEQ ID NO: 38), Abc20 (SEQ ID NO: 40), 
Abc21 (SEQ ID NO: 42), Abc22 (SEQ ID NO: 44), Abc23 (SEQ ID NO: 46), 
Bmr (SEQ ID NO: 48), NorA (SEQ ID NO: 50), Mfl (SEQ ID NO: 52), Mf2 

15 (SEQ ID NO: 54), Mf3 (SEQ ED NO: 56), Mf4 (SEQ ID NO: 58), Mf5 (SEQ 
ID NO: 60), Mf6 (SEQ ID NO: 62), Mf7 (SEQ ID NO: 64), Matel (SEQ ID 
NO: 66), and Smrl (SEQ ID NO: 68); 

(ii) the test substance; . 

(iii) a compound that is capable of being transported by the 
20 MDR efflux pump; and 

(b) measuring transport of the compound across the lipid membrane, 
wherein decreased transport, relative to transport in a cell-free system not 
comprising the test substance, identifies the test substance as containing a 
compound that reduces efflux of antibacterial agents from a cell. 

25 34. The method of claim 33, wherein said lipid membrane is in the 

form of a liposome. 
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GCT TCT AAT CTC GTG GTA ATT TTT ACG CGA GCT GCC GOT TCC GCA AAC CGT GTC 

A S N LV V I F T R A A A S A N RV 
AAT GAG GTT TTA ACT GTA GAA AGC CAG TTA ACA GAT ACA CCA GAA TCA GCA AAA 

NEVLTVESQLTDTPESAK 
ACA TCT CCA CAA TTT GGT GAT ATT ACC TTT GAC CAT GTA GAT TTT CGC TAT GAA 
• T SPQFGDITFDHVDPRY E 
CCA GAG GCC GGT TTG GCT TTG GAA AAT ATC AAT TTT ACG ATT CCT AAA GGC TCA 

PEAGLALENINFTIPKGS 
ATT CTC GGC ATC ACA GGA CCT ACC GGC AGC GGG AAA AGT ACC TTA ACC CAA CTC 

II,GITGPTGSGKSTLTC I» 
ATT CCC AGA TTT TAT GAT GTG AGT GCA GGA AAC CTT TTT ATT AAT GGT G3A AAT 

I PR F Y D V S A G N Zj F I NGVN 
GTG CGC GAT TGG CCG CTC TTT ACT TTA CGC CAA CAA GTT GCA AGT GTT CCA CAA 

VRDWP LFTLRQQVASVPQ 
ACT GCC GTC TTA TTT ACA GGG ACT ATT CGA GAA AAC TTA CAA TGG GGC AAA CCA 

T A V L FT GT I RENL QW GKP 
AAT GCA ACT GAT GAA GAC TGT TGG GAA GCA TTA GCT ATC GCC CAA TGT AAA GAA 

N A T D ED CWBALAI AQCKE 
TTT GTT GAA CAA TTA GAC CAA GGG TTA GAC ACG CCT GTT AAC GAA GGC GGA AAA 

PVBQIiDQGLDTPVNEGGK 
AAC TTT TCT GGT GGA CAA AGA CAA CGG TTG ACC ATT GCC CGT GCC TTG ATT AGG 

N FS GGQRQRIiTIARALIR 
AAA CCG CAT TTA CTT ATT TTA GAT GAT TCC TTG AGT GCC CTC GAC TAT CAA ACA 

KPHLIiI LDDSLSALDYQT 

GAT TTG AAT CTG CGT CGT GCT TTA CAA AAA GAA CGA GCA GAG ACA ACC GTT ATT 
DLNLRRALQKERAETTVI 

TTA ATT TCA CAA CGC GTG AGT TCG ATT GCA ACG GCG AAT CAA ATT TTA GTC CTA 
LISQRVSSIATANQILVL 

GAT AGT GGA AAA GTC GCT GGC CTT GGC ACC CAC GAA GAA TTA CTT ACT TCT TCT 
DSGKVAGLGTHEEIiLTSS 

AAA GAG TAT CAA GAA ATC GTT GCG TCA CAG GAG GAG GAT ACC CAT GCA AAC TAA 3 • 
KEYQE IVASQEEDTHAN* 



Fig. 4B 



WO 01/79257 PCT/US01/12230 

7/77 . 

5 » ATG AAA CGA ATC GGT CGC TAT ATT AAA CCG TAC CGA GTG ACG TTT TAT TTA GTT 

MKR I GRYIKPYRVTFYLV 
ATT TTA TTT ACA ATA TTA ACC GTT GCC TTT AAT GCA GCG TTG CCT TAT TTG ACT 

ILFTILTV AFNAALPYLT 
GGA TTA CCG ACG ACA GAA ATT AGC CGT AAT ATT GCG GCC GGC GAA TCC ATT AAT 

GLPTTEISR NIAAGESIN 
TTT GAT TAT GTA ATC CAA TGT TTA ATT TGG ATT TTA GTT GTG GGA ACA GGT TAT 

FDYVI QCLIWILVVGTGY 
TGT GTG GCA CAA TTT TTG TCA GGC TTT TTA ATG ACG AAT GTC GTT CAA CAA TCC 

CVAQFLSGFLMTNVVQQS 
ATG CGC GAT TTG CGT CGC GAT ATT GAA GAA AAA ATC AAT CGT TTG CCA GTT TCT 

MRDL RRDIBBKI N RLPVS 
TAT TTT GAT AAG Ji?JC CAA CAA GGA AAT ATT TTG TCA CGG GTG ACG AAC GAT GTG 

YFDKN QQ GNILSRVTNDV 
GAT GCT GTC AGC AAT GCG ATG CAA CAA AGT TTT ATC AAT ATT GTT TCA GCA GTC 

DAVS NAMQQSFIN IVSA V 
TTA GGT ATT GTG ATG GCG GTA GTG ATG ATG TTC TTA ATC AAT CCG CTG ATG GCG 

L G I VMAVVMMFL I NPLMA 
ATT TTT TCA GTG ATT ATG ATT CCG TTG TCT CTG ATT ATT TCC AGA ACA ATT GTT 

IFSVIMIPLSLIISRTIV 
AAA ATC TCC CAG AAA TAT TTC CAA GGA ATG CAA AAT TCT TTA GGA GAC TTA AAT 

KISQKYFQGMQNSLGDLN 
GGT TAT GTC CAA GAA AAT ATG ACT GGG TTC AGT GTC TTA AAA CTA TAT GGT CGG 

GYV Q.ENMTGFSVL KLYG R 
GAA AAA GAA ACC CTT GAA GGC TTT AAA CAA GTC AAT CAT CGT TTA AAT GGT TTT 

EKE TIiEGFKQVNHRLNGF 
GGT TTC AAA GCA TCC TTT ATC TCA GGA TTA ATG TTG CCA TTG GTT CAG ATG ACC 

GFKA SFI SGLMLP LVQMT 
GCT TAT GGG ACC TAT ATC GGG GTA GCT GTC CTT GGT AGT TAC TAT GTG GTT GCT 

AYG TYI GVAVLG S YYVVA 
GGT GTG ATC GTA GTG GGG CAA TTA CAA GCG TTT ATT CAA TAT ATT TGG CAA ATT 

GVIVVGQLQAFIQYIWQI 
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AGC CAA CCA ATG GGG AAT ATT ACG CAG TTG TCT GCA GCT TTA CAA AGC GCT TCA 

SQPMGNITQIiSAAL QSAS 
GCT TCG ACC ATG CGG ATT TTT GAA ATC CTA GAT GAA CCA GAA GAA GAA CTT AAC 

ASTMRIFEILD EPE EELN 
GAA CAA GAT GTT CCT TTG CCA GAA CCT ATT TTA GGC TCT GTT GAA TTT GAA AAT 

EQDVPLPEPIL GSVEFEN 
GTC AGC TTT AGT TAT GAC CCA GAA AAA CCG TTA ATT CGT AAT TTG AAC TTT AAA 

VSFSYDPEKPLI RNLNF K 
GTT GAT GCG GGC CAA ATG GTT GC3 ATT GTG GGA CCA ACT GGC GCT GGG AAA ACA 

VDAGQMVAIVGPTGAGKT 
ACC TTA ATC AAC TTA CTG ATG CGT TTT TAT GAT »3TA ACA GAA GGC GCC ATT AAA 

T It I N L L M R F Y D 7 T E G A I K 
ATT GAT GGC ATT GAC ACG AAA AAA ATG AAC CGT AGT GAT GTC CGA TCT GTA TTT 

IDGIDTKKMNRSDVRSVF 
GGA ATG GTA TTG CAA GAT GCT TGG TTG TAT AAA v 3T ACC ATT GCA GAT AAC ATT 

GMVLQDAW LYKGTIADNI 
CGT TTT GGG AAG TTA GAT GCC ACG GAT TAT GAA GTT GTC GAT GCA GCG AAA ACG 

RFGKLDATDYEVVDAAKT 
GCC AAT GTG GAT CAC TTC ATT CGG ACA ATG CCA GAC GGG TAT GAA ATG GAA ATC 

ANV D H F I RT MP D G Y E M E I 
AAT TCT GAG GGA GAT AAC GTT TCC CTT GGT CAA AAA CAA TTG TTG ACC ATT GCC 

NSEGDNVSLGQKQIiLTIA 
CGA GCG GTA ATT TCT GAT CCG AAA ATT TTG ATT TTA GAT GAG GCG ACT AGT TCA 

RAVI SDPKXIiX Ii DEATSS 
GTC GAT ACA CGC TTG GAA GCC TTA ATT CAA AAA GCA ATG GAT CGT GTT ATG GAA 

VDTRL EAItlQKAMDRVME 
GGA CGA ACG AGT TTC GTT ATT GCC CAC CGC CTA TCA ACT ATT CGT GAA GCT GAT 
GRTSFVIAHRLSTIRBAD 

TTA AIT CTT GTT ATG AAA CAA GGA GAA ATC ATT GAA AAA GGT ACG CAT CAT GAG 
LI LVMKQGEI I EKGTHHE 

TTG CTG GAA CAA GGT GGC TTC TAT GAA AAA CTA TAC AAT AGT CAA TTT GCT GAA 
LLEQGGFYEKLYNSQFAE 
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ATT GTC GCA GGT AGC CGT ATT TTA AAA ATT ATG GAT ACG GAA GAA TTA ACG CCG 

I.VAGSRILKIMDTBELTP 
CAA CAA TCA GTA GGT GCG AAT GGG GAA ATT ATT CGC GGG AAA ATT GAA TTT CGC 

QQ SVG ANGB I I RG K I E F R 
AAT GTT ACT TTT TCT TAT GAC GGC AAA AAC GAA GTA TTG AAA AAT ATT AGC TTT 

NVTFSYDGKNEVLKNI SF 
GTC GCA AAT CCC GGG GAA ACC GTC GCT TTA GTC GGT CAT ACA GGA AGT GGT AAA 

VANPGETV&LVGHTGSGK 
AGT TCA ATT ATT AAT GTA CTC ATG CST TTC TAT GAA TTT TAT GAA GGG CAG ATT 

S S I I N V L M ii F Y E F Y E G Q I 
TTA ATT GAT GAT CGT GAC ATT CGT GAT TTT CCC ATG ACA GAA TTA CGG GAA AAG 

LIDDRDIRDFPMTBLREK 
ATG GGC TTA GTA TTA CAG GAC GCT T.- »: ATG TTT TAT GGA GAT ATT GCT GGA AAT 

MGLVI.QDA .FMFYGDIAGN 
ATC CGT TTA CTG AAT CCC AAT ATT ACA GAT GAG CAA ATT AAA CAG GCT GCG GAA 

IRLLNPNITDBQI KQAAB 
TTT GTT CAG GCA GAT AAA TTT ATT CAC ACG TTA CCC AAT ACC TAT CAT GCG AAA 

FVQADKFIHTLPNTYHAK 
GTC ATT GAA CGA GGA GCG AGT TAT TCT AGC GGG CAA CGT CAA TTA ATT TCG TTT 

VIERGASYS SGQRQLISF 

GCA CGA ACG ATT GTG ACT GAT CCC AAA ATT TTA GTT TTA GAT GAA GCA ACC GCT 
ARTIVTDPKILVLDEATA 

AAT ATT GAT ACA GAA ACA GAA GGA TTG ATT CAA GAA GGA TTA GCC AAA ATG CGT 
N I DTKTEGLIQBGLAKMR 

CAA GGG AGA ACG ACA ATT GCG ATT GCG CAT CGC CTT TCG ACG ATT CGT GAC GCA 
Q*GRTTI AIAHRLSTI RDA 

AAC TTG ATT TTA GTT TTA GAC AAA GGG CAG ATT GTA GAA CGA GGA ACG CAC GAA 
NLILVLDKGQIVBRGTHB 

ACG TTA CTG GCA GAA GGT GGC CTT TAT GCG GAT ATG TAT CAG TTA CAA AGT ACG 
TLLAEGGLYADMYQ LQST, 

GAA GTT TAA 3 » 
B V * 
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GGT TTA GTC GTT ATG TTT GTC CAA TAT TCT CAA CAA TAT TAT CAA CCA TTA ATG 

GLVVMFV-QYSQQYYQPLM 
CAA ATT TCT TCA GGT TAT AGC ATG ATT CAA TTA GCT GTT ACA GGC GCT CGT CGT 

QISSGYSMIQLAVTGARR 
TTA AAT GAG ATG TTT GAT GAA CCA GAT GAA ATT CGT CCC GAA AAT GGC GAA AAA 

LNEMF D EPDEIRPENGEK 
TTA GAA GAG ATT AAC AAA GCT GTG GCT TTA AAT CAT GTT GTT TTT GGC TAT AAT 

LEE INKAVALNHVVFGYN 
CCA GAA ACG CCT GTG TTG AAA GAT GTT TCT ATT CAT GTG GAT AAA GGT GAA ATG 

PETPVLKDVS IH VDKGEM 
GTC GCT TTA GTT G&A CCG ACA GGC TCA GGA AAA ACA ACG ATT ATG AAT TTA ATG 

VALVGPTG'SGKT'TIMNIiM 
AAT CGC TTT TAT GAT GTG AAT GAA GGC GCC GTG ACG TTT GAT GGT GTG GAC ATT 
yNRFYD VNEGAVTFDGVDI 
CGT GAA ATG GAT TA GAT AGC TTG CGT TCA CAT GTA GGC ATT GTT TTG CAA GAG 

R E M D Ii D S L R S H V G I V L Q E 
TCC GTT CTA TTT TCA GGA ACA ATT CGT GAG AAT ATT GCT TTT GGT AAG CCA GAG 

SVLFSGTIRBNIAFGKPE 
GCA ACG GAT GAA GAA ATT GTT CAA GCA GCG AAA CAA GCG AAT ATC CAT GAA TTT 

ATDE E I VQAAKQ AN I H EF 
ATT GTA AAT CTT GAG CAA GGC TAC GAT ACT GAG ATT ACG GAA GAG AAT AAC CTT 

IVNLBQG YDTEI TEENNL 
TTC AGT ACG GGG CAA AAG CAA TTA GTA AGT ATT GCA CGA ACG ATT ATT ACC AAT 

FSTGQKQLVS IARTI ITN 
CCA GAA TTA TTA ATT TTG GAT GAA GCA ACA AGT AAT GTT GAT ACA GTA ACA GAA 

P. BItLI LDEATSNVDTVTE 
GCT AAA ATC CAA AAA GCG ATG GAC GAA GCA ATT AAA GGG CGC ACC AGT TTT GTG 

A K I Q K A M D E A I K G R T S FV 
ATT GCC CAT CGT TTG AAA ACG ATT CTA AAT GCG GAT CGG ATT ATT GTT TTA CGA 

I AHRLKT I LNAD R I I VLR 
GAC GGT GAA GTC ATT GAA GAA GGC AAC CAC CAT GAA CTA GTT GAA CAA GAT GGA 

DGBVIBEGNHHELVBQDG 
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TTC TAT GCG GAA CTT TAT AAA AAT CAA TTT GTT TTT GAA TAA 3 1 
FYAEL YKNQFVFE* 
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GCG CCG CAC GCT ATT GAG TTG CCG CCA ATT AAA GGA ACC GTC GAC TTT CAG CAT 

APHAI B LP P IKGTVD F Q H 
GTT TAT TTC CGT TAT GAA GAA GGA AAA AAT ATT TTG ACC GAT GTT ACT TTT CAT 

VYFR YEBGKNILTDVSFH 
ATT GAG CCA GGA CAA ACA ATC GCC TTG GTT GGG CCA ACA GGT GCG GCT AAA ACA ... 

IE PGQT XAIiVGPTGAGKT 
ACA ATC ATC AAT TTG TTA AGT CGT TTT TAT GAT GTG AAT GAA GGG GCC GTT AAA. 

TI INLLSRFYDVNEGAVK 
ATT GAT GGT TAT GAT GTT CGC GAT GTG ACA CTC CGT TCG TTA AGA AAA CAA ATG 

II GYDVRDVTLRSLRKQM 
GGG GT?. ATG CTT CAA GAT ACG TTT ATT TTT TCA GGA ACQ ATT ATT GAA AAC ATT > 

GVMLQDTFI FSGTZ I E N I 
CGG TAC GGA AAT TTA GCA GCC ACA GAG GAA GAA GTC ATC CAA GCC GCA AAA ATT 

R Y GNLAATEEEVI QAAKI 
GTT CVt GCG CAC GAC TTT ATC AAG GAT TTA AAA GAT GGC TAT GAA ACA GTT GTG 

VRAHDFI KDLKDGYETVV 
GAA GAG CGG GGT AGT ACA CTC TCG GCA GGA CAA CGC CAA TTA ATT TCA TTT GCT 

EBRGSTLSAGQRQLISFA 
CGT GCT TTA CTG GCA GAT CCC AAA ATT TTA ATT TTA GAC GAA GCA ACC TCC AGT 

RALLADPKI LILDEATSS 
ATT GAT ACA AAA ACA GAA GAA TTG TTA CAA GAA GGA CTA CAA CAA CTT CTG AAA 

IDTKTEELLQBGLQQLLK 
GGA CGG ACA TCG TTT ATT ATT GCT CAT CGA CTT TCT ACC ATC AAA AAT AGC GAT 

GRTSFI IAHRLSTIKNSD 
AAA ATT TTC TAT ATT GAT GGT GGT CGT ATT GTG GAA GAA GGA TCT CAT GAT CAA 

KI FYIDGGRIVBEGSHDQ 
CTA ATG GCA AAA CAT GCG TTA TAT CAT CAT TTA TAT CAA TCG CAA TAC GAC TTA 

LMAKHALYHHLYQSQYDL 
TTA AAG AGC TAG 3 • 

L K S * 
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5 ' ATG CAA GAT ACT ATA CAA GAT TGT GGT GTT GCA TGC ATA GAA ATG ATA TGT AAA 

MQDS IQDCGVACI EMI CK 
TTC TAT AAT ATT AAT ATA GAT AGA AGA TAT ATC CAA GAA GAA ACT GGC TAT GGA 

FYNINIDRRYIQEETGYG 
ATG ATA GGA ATA TCA TTA AAA GCT ATG GAA AAA TTT TTT TCA AAA GTA GAT GCA 

MIGI SLKAMEKF FSKVDA 
AAC CCA GAA ATA GTA AAT ATA TCT ilAA ATT AAT AGG TTA AAC AAA GAA AAT AGG 

NPEIVNIS K I N R L N K E N R 
GAA ATG ATT AAT AAC TCA CTG CCA GCA ATT GTC TTT CTT GAA GAG GAG GAA ATT 

EMINNSLPAIVFLEBEEI 
ATT AAT CAT TTT GTA GTA ATC TGG CAT ATT GGT AAG AAA AGA ATA CTT GTT TCA 

INHFVVIWHIGKKRILVS 
GAT CCT ACA CAT ACT AAA AAA GAA 'IGG ATA AAT AAT AAA CAT TTT GAA AAA AGA 

DPTHTKKEWINNKHFB KR 
GCT ATT AGT TAT TTA TTT GTA GAA AAA CCA AAA AAT ATT TTT TTA AAT CGC ACT 

AISYL FVE KPKNI FLNRS 
CCC AAA AAA GTA AGA TTT TAT GGG AAG TTT ATA AAA AGA AAT TTA AAA ATA TTC 

PKKV RFY GKFIKRHLKIP 
TCG TTA GTA ATG TTC TTT TCT ATA ATT GTT AGT TTG TTA AGT ACT TAT CTA TCA 

SLVMFFSIIVSLLSTYLS 
TTT AAT TTA GGT AAA AAT ATA AAT ATT CTT GAA AAA AAT ATT CCT TTG CAA GAA 

FNLGKNIHILEKNI PLQE 
AAA ACT TTT CTT ATT CAA AGT ATA TCT ATA GGT TTT ATT GTA TTA GCT TTA GTA 

KTFLIQSI SIGFIVLALV 
CAA GTA GTA GTT AGC TAT TTC AAA AAT ATT CTA GTT ATA AAA GGA ACA TTT GCT 

QVVVSYFKNILV I KGTFA 
TTT GAA ACA CAA TTA TTC AAA AAG TAT ATA AAT AAA ATG TTA ACT ATT CAT CCA 

FETQLFKKYINKMLTIHP 
AAA TAT TTT CAG TTA AAT AGT TCA GGA GAA TTT GTA TCT AGA ATT TCC GAT ATC 

KYFQ LNS S GEFVS R I S DI 
TTA CAA GTT AGT AAC GAT GTG ATG AAC TTT TTC ATA AGC GTA CTT CTA AAC TTT 

LQVSNDVMNFFISVLLNF 
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TGT ATG TAT GTT TTC TCG ATA ATA ACA TTG TTT TAT TTT ACT AAG GTT ATQ GOG 

CMYVFS I ITLFYFSKVMA 
ATT TAT CTT GTA GTT ATT TCT GCA GTA GTA ATA TTG TTT TTT GCA TAT ATT TAT 

IYLVVI SAVVI LFFAY IY 
CCA AAA ATC TAT CAT TTT GGG TAC ACA ATA TCT AAA TCC AAT ACT GAT TTT CAG 

PKIYHFGYTISKSNTDFQ 
AAA AAA TTA ATT ACT TTT ATT GAT GGA TTA GAA GAA TTT AGG GCT TTA AAT GCT 

KKLIT F IDGLBEFR ALNA 
GAT AAA TAC TTT AGT GAG AAA GTC TTA GAT AAA ATA AAT AAA TTT ATT ACT AAT 

DKYFSBKVLDK1NKFITN 
AAT AAA AAA CTA CAA TTT TTT GGT ACA AAT AGT AAT GTT TTT TTA AGT TTA TTC 

NKKLQFFGTNSNVFLSLF 
AAT ACT GTT GGG ACA GTA ATA TTG ATA GCT ATA GGC AGC TTA TTA GTA TTA AAG 

N TVGTVIL IAIG SLLVLK 
AAT AGT ATA AAT TTA GGA AAT TTA ACA GTA TTA TCA TTA TCA GGA ATA GTA 

NSZNLGH LTVLIS .IiSGIV 
GTA TCA TCC CTA AAT CAA TTT GGG TCG TTT CAA GCG CAA TAT GAA TCG ATG CAG 

VS S LN Q FGS FQ AQY E S MQ 
GTG GCT AGT CAT AGG CTT GAA TCT ATA TTA ATA AAT ATG GAA AAT GAG AAT GTA 

VASHRLESIIiINMENENV 
TGC GGG GAA ATA ATT TTA GAC AAG AAA ATA GAA TCA ATA AGA TGT AAA CGT GTT 

CGEIILDK KIESIRCKR V 
TCC ATA AAA AAA GGA GAT ACT TTA TTA CTC GAC ACT GTT AAC TGT GAA ATC TAT 

S I KKGDTIiLl»DTVNC E I Y 
AGA GGG AAA AAT CTT TCA ATA CGA GGT GAA AAT GGT TCA GGT AAA TCT ACA TTA 

RGKNLS IRGENGSGKS TL 
ATT AAG TCT TTA GTA AGA TTA GAT GAT GAC TAT AGG GGA CAG ATT CTA ATT AAT 

IKSLVRLDDDYRGQI LIN 
AAT ATT GAC ATA AAA AAA ATC AAC TTG GAC TGT CTA CGA TCA AAA TTA GTG TTT 

N I D.IK K I N L D CL R S.K L V F 
GTA GAG CCC AAT CCT AAA TTT TTA GAA GGT ACA ATT AGA GAT AAT TTA TTA TTG 

VEPNPKFIiEGTIRDNLLL 




WO 01/79257 



26/77 



PCT/US01/12230 



GGA CAT AAA GTT CCG ART ACT ATA TTT AAT AAG TTG ATA AGA GAT TTT GAA ATA 
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5' ATG GAA AAA CAG CCG ACA ACG AAA GAG GTT TGG AAA AAC GAT CAA TGG GTT CGT 

MEKQPTTKDVWKNDQWVR 
CCT TTC TTA AAA CGC TAT AAA AAA ACC TTG TAT TTT GCC TTA CTT TTA GGC TTC 

PFLKRYKKTLYFALLLGP 
TTG ACG TTC TTT AGC GCA GGG GCA TTA ATG TTT ACG TCT GGT TAC TTA ATC AGT 

LTFFSAGALMFTSGYLIS 
CGT GCG GCT TCC TTA CCA GAG AAT ATT TTA TTA ATT TAT ATT CCC ATT GTT TT2- 

R A A S LP EN. I LL I Y I P I VIi 
ACA AGA GCA TTC GGG ATT GGA CGT CCT GTT TTT CGT TAT GTG GAG CGT CTA ACG 

TRAFGIGRPVFRYVERLT 
AGT CAT AAC TGG GTG TTA AAA ATG ACC TCT GAT CTG CGC TTG AAA CTT TAT AAT 

S H N W V L KMTSDLRLKLYN 
GTA CIA GAA AAA GAT GCG ATT TTC TTT AAG ACA AAA TAT CGC ACT GGG GAT ATC 

VLEKDAIFPKTKYRTGDI 
TTA GGT TTG CTT TCG GAG GAT ATT AAT CAT ATC CAA AAC TTG TAT TTA CGA ACG 

L G L L S E D I N H I Q N * L Y L R T 
ATT VTC CCA ACC GTT ATT GCT TGG ATT TTG TAT ATC TTT TTA GTG ATT GCA TTA 

IFPTVIAWILYIFLVIAL 
GGT TTC TTT TCT TGG TGG TTT GCT TTA TGC ATG TTA CTG ATG TTA GGA GTG GTT 

GFFSWWFALCM LLMLGVV 
GTT TTT CTA TTA CCA CTT GTT TCA GTT CTA GTG AAT GGT GCG CGT CAA GAA AAA 

VFLLPLVSVLVNGARQBK 
CAT AAA TAT GCC AAA AAT GAG TTA TAT CAA ACA CTA ACC GAC AAT ATT CTA GGT 

HKYAKN EL YQTLTDNI LG 
GTG TCT GAT TGG GTA TTT AGT CAA CGA GGC TCT GAA TTC GTT GCT CGT TAT GAA 

VSDWVFSQRGSEFVARYE 
ACA GAT GAA GCA AAC GTT CGT GCA TTA GAT GAA AAA ATG AAG CAA TTT AAC CGT 

TDEANVRALDE KMKQFNR 
GGC CGA GAC TTT GTT TTA CAA CTT CTG TTT GGT GTG ATT GCG ATT GCT GTC TTA 

GRDPVL QLLFGVIAIAVL 
GCT TGG ACA AGT GTG CGG TTT CCA GGT AAT CAT GGG GGC GCA GCC AAT TGG ATT 

AWTSVRFPGNHGGAANWI 
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GGT GCC TTT GTT TTA ACC GTT TTT CCG TTA ATT GAT GCC TTT GCT CCG CTA CCT 

GA 'PV IiTVPPLI DAFAPLP 
GCT GCA GCA CAG GAA ACA ACG ATT TAT AAG GAT TCC ATT CGT CGT TTT AAT. GAG 

A AAQBTTIYKDSIR RFNE 
CTT CCT GAA GGG GAA GAT GAC TCA ACA GAA GOG CCT GTT CAG CCA AAT GGC ACC 

LPEGEDDS TEAPVQPNGT 
GGT TTA TCG ATA GAA CAT TTG TCT TTT GCA TAT GAA AAT CAA GOG AAA AAG GTA 

GLS I EHLS FAYENQAKKV 
TTA AAT GAC TTA TCG TTA ACA ATT CCT GAA AAA CAA AAA TTA GCG ATT TTA GGG 

LNDLSLTI PEKQKLAILG 
CGC ACT GGT TCC dti AAG ACT ACG TTG GCT TOG CTC ATT CGT GGT GAT TTA CGG 

RS GS GKST LASLI RGDLR 
CCT ACT TCG GGA GAA ATC CTT TTA GGA AAT ATC CCA ACT GAG GCA TTT GGT GAG 

P T S G E ILLGNI PTEAFGE 
ACG ATG ACT GAA TAT ATT GGG GTT ATG CAT CAA GCA CCG TAG CTA TTT CGA ACA 

TMTEYI GVMHQAPYIiFRT 
ACC ATT TTA AAT AAT ATT CGG ATT GGT AGA GAA GAA GCA AGO GAA GCA GAA GTT 

TI LN N I R I GRE BAS EA'BV 
TGG GCG GTT TTA GAA AAA GTT GGC TTA AAA GAA ATG GTT GCC CAA TTA CCA GAA 

WAVLEKVGLKEMVAQLPB 
GGT TTA CAA ACA ATG GTG GAT GAA GCA GGT TTG CGT TTT TCT GGT GGC GAA CGA 

GLQTMVDEAGLRFS GGER 
CAT CGC TTG GCG CTA GCG CGG ATT CTT TTG CAA GAT ACA CCG ATT GTG TTA TTA 

HRLA L A R I L.LQDTP IVLL 
GAC GAA CCA ACT ACT GGT TTA GAT CCA ATT ACA GAA CAG CAA TTG CTG GAA ACC 

DBPTTGLDPITEQQLLET 
TTT TTT GAG GCA TTA AAA GAT AAA ACC GTC ATC TGG ATT ACC CAC CAT TTG CAA 

FFEALKDKTVI WI THHLQ 
GGC GTA ACT TTG ATG GAT CAA GTG ATC TTT ATT GAA GAT GGC CAA TTA GAG ATG 

GVTIiMDQVIFI EDGQLEM 
AGT GGT ACG CCA GAA GAA TTA CTT GCA ACC AAT GCG CAT TAT CAG AAA CTT TAC 

SGTPEELLATN AHYQKLY 
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CGA ATT GAT COT GGC ATT AGT TCT TTC GAG GAA TAA 3 1 
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5 • ATG AAA GAG TTT ATT AAA GAA AAT AAA TGG ATT GTT CTT GCG ACA ACT TTA ACC 

MKE FI KB N KW I V L A T T LT 
ATC TGT TTG CAA ATC GCA GGA ACC CTT GGC GTC CCT AAA TTA GTT GGC AAG TTG 

I CLQ I AG T L GV P K I> V G KL 
ATT GAT GTG GGC ATC GTT AGC GGT GAC CAA CAA GTA ATT AAM ACG ATT GGC ATA 

IDVGIVSGDQQVIXTIGI 
CAA ATG TTT CTT GTG GCA TTC ATT GGA ACA ATT GCC GCA ATT ATT TCT AGC TAT 

QMPLVAP I GT IAAI I S S Y 
TTG TCT GCT TTA GTA GCT GCT AAA TTT GGT TTT CAA GTT AGA GGA TTG TTC TTT 

LSALVAAKFGFQVRGL FP 
AAA AAA TTT CAA CAA TTC TCG ATG AAA AAT GTT GAT AAA TTT GGT TCA AAT TCT 

KKFQQFSMKNVDKFGSNS 
TTG CTA ACT AGA ATG ACC AAC GAT GTA GAT AAT' GTT CAA ACA ATG ATT GTG CTG 

L LTRMTNDVDNVQTMIVL 
TTT TGC CAA TTA ATC TTT CCA GCG CCT ATT ATT *GT TTA TTT GCC TTA GTG ATG 

PCQLIFPAPIISLFAI*VM 
ACA TTT TCT TAT TCA GTT TCA CTC GCT TGG GTA ACA TTG GCT TCC ATC GTA TTT 

TFSYSVSLAW VTLASIVF 
TAC TTA GTC GTT GTT TAT TTT TTA ATG AAA AAA GGT ACC CCT TTA TCT TTA AAA 

YLVVVYFLMKKGTP liS LK 
ATT CAA CCA AAA ATG GAT CGA ATT ACT ACG ACT TTA CGA GAG TTC TTT ACT GGA 

I QPKMDR I TTTLREr. F-F T G 
ATT AAT ATG ATT CGT GCG TTC AAT AAT CAA GAT TTT GAA GAG CAG CGA ACC AAT 

I N M I RAFNNQDFEEQRTN 
CAA ACA TTT AAA AAT TAT GCT GAA CGC ATG AGT AAA GTG AAT CAA ATC TTT GCT 

QTFKNYAERMSKV K Q I F A 
TGG ATT ACT CCC GTT GCC TTT TTA TTA ATG GGC GTT GTG TAC GCC TCT ATT TTG 
WITPVAFLLMGVVYASIL 

TGG TTT GGC GGT AAT TTA GTT GCA GTA GGC ACC CTA CAA ATT GGC ACC GTT ACA 
WPGGNLVAVGTLQI GTVT 

GCT GTG ATT GAA TAT ACG CTT CTA ACT TTG GCC TAC TTA ATG ATC GCG GCT ATG 
AVIEYTL LTIiAYLMIAAM 
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GTA TTA GTT GTC ATT CCA CGG TCC GTT GCT TCC TTG AAT OGC TT6 CAA GAA GTT 

V l" "v" V I P R S V A S L N R L Q E V 
TTG TCA GAA GAA ATT GAA ATT AGC GAT CCT CAT ACT GAG GCA ACC ATT GCT TAT 
"h S "e" E I B I S D P H T E A T I AY 
CAT CCT GAG AAA GCC TTG ATT TGC TTC GAT CAC GTC ACG TTT CAA TAC ACA GAA 

V T Y "k "a" h T C F D H V T F Q Y T E 
ACA GCT GAT CCT GTT TTA GAA AAT GTT AGT TTT GTC ATT CCT AAA GGA AAA ACA 

t" A D P "v I. E B V S F V I P KG K T 
ACG GOG ATT GTT GGT GCA ACT GGC GCT GGT AAA AGT ACT TTA GTT AAG TTA CTT 
~T~~AIVGATGAGK STLVKLL 
TTA CGA ATA AAT GAG GTC ACA GCC GGC ACG ATT AGC TAT TCT G2C ACA GAT ATC 

L~ r" I N E V T A G T I S Y S G T D I 
CGC TCA TTA TCT CAG CAA ACG ATT CGC CAA GTC ATC AGT TAT GTG CCA CAA AAA 

V S~ L S Q Q T I R Q V I S Y V P Q K 

GCC TTT CTT TTC AGT GGG ACA ATC TTA TCA AAC TTA TTA ATG (r\*«k AAT GCC AAA 
A PLFS GTlIi SNLItMGNAK 

GCA ACT ACA GAA GAA ATA AGA ACG GCA CTA GAA ATT TCA CAA TCT TCT GAA TTT 

ATTEEIRTALEISQSSE F 
ATC GAT TCC TTA CCA CAA GGG ATT GAA AGT TTC GTA GCA CAA GGC GGG TCC AAC 
T D "s "I" P Q G I E S F V A Q G G S N 
TAT TCT GGT GGT CAA AAA CAA AGA ATG TGT ATT GCA CGA GCC TTA ATC AAA CCG 

"y' sggqkqr mciaral ikp 

GCA GAC GTT TAT ATT TTC GAT GAT AGC TTT TCC GCA TTA GAT TAC AAA ACT GAT 
"a "d" "v "y I F D D S F S A L D Y K T D 
GCC GCT CTA CGT GCC GCT TTA CAT GCA CAA ATG TCG GAC AAA ACT TTA CTC ATT 
"a" A " "b" R A A L H A Q MS D K T h L I 
GTT GCT CAA CGG TTA AGT ACA ATC ATG AAC GCT GAC AAC ATT ATC GTC CTA GAT 
~V~ ~AQRI»STIMNADNI I V 1» D 
GAA GGA AGG ATT GTT GGT CAA GGC ACC CAC GCT GAT TTA CTT ACC ACT AAT AGC 

E V R _ "i V G Q G T H A D L L T T N S 
TAT TAC CAA GAC TTT GCT AAA TOG CAA GGT ATC TTA CCC AAG TAA 3 1 

YYQDFAKSQGILPK* 



Fig. 14B 



WO 01/79257 PCT/US01/12230 

32/77 

5 1 ATG AAA AAT TTT TAT AAA AAA AAA TTT GCT TTA ACT GAT CAG GGC GCA GAA GCT 
MKNPYKKKFALTDQGAEA 
TTA ACT AAA GCT TCT ATT TCT AGT TTT TTT GTA TAG TGC ATT AAT ATG GTA CCA 

LTKASI SSFFVYCINMVP 
GCA TTC ATT ATA ATG ATG CTG ATA GAC GAA TTA GTT TTA GAA AAT GCA AAG CCC 

AFIIMMLIDELVLENAKP 
CGT TGG CTT TAT TTT GCA GTA TCG TTT GTT ACT TTG CTT TTC ATG TAT TGG TTG 

RWLYFAVS FVTL LFMYWL 
TTA GAT AGG GAG TAT GAA AAC TTA TAT AAT AGT ACT TAT AAA GAA AGT GCG CAT 

LDREYENIiYNSTYKESAH 
TTA AGA GTG CAA ATT GCA GAC GAT TTG TCA AAT TTA CCA TTA TCC TAT TTT TCA 

LRVQIADDLSNIiPLSYFS 
AAA CAT AAT TTA TCA GAT TTA TCT CAA ACT ATC ATG TCT GAC GTT GAA GGT ATT 

KHNLSDLSQTIMSDVEGI 
GAG CAT GCG ATG AGT CAT GCA ATA CCT AAA TCC GGC GGT ATG GCT CTG TTT TTC 

BHAMSHAIPKS GGMALFF 
CCT TTT ATT TCA GTG ATG CTT TTG GTT GGT AAT GTC AAA ATG GGA TTA GCT GTT 

PFISVMLLVGN VKMGIiAV 
ATT TTG CCA ACG TTA TTT AGT TTT GTC TTA ATC TTG TTA TCA AAG AAA TCC CAA 

IL PTLFS F V I* I l» h 3 KKSQ 
ACG AAA GCC AAT ACT AAA TAT TAG GAT ACT TTG AGA GAA. AAC TCG GAA GAA TTT 

T KANTKY YDTLRENSEEF 
CAA GAA ACT ATT GAA CTG CAG CAA GAG ATT AAT AGC TTT AAT CTA TCT AAA AAA 

QETIBLQQEINSFNLSKK 
GTT CAA GAC AGA CTT TTC AAA AAA ATG GAA GAG AGT GAA AGG ATT CAT TTA AAG 

VQD RIi FKKME ES ER I HLK 
GTA GAA TTA AGT ACT TTT TCA GTC ATG GCC TTA TCC TCT ATT TTC TCA TAT GTT 

VELSTFSVMAIiSSIF SYV 
AGT TTA GCA GTA GTC ATT CTA GTA GGT GTT CAC TTA CTG TTA ACG GGT GAA GTG 

SLAV V I LVGVHLIiLTGEV 
ACT ATA CTC TAC GTC GTT GGT TAC TTA CTA GCC GCA ATA AAA ATA AAG GAT TCC 
TI LYVVGYIiLAAI KI .KDS 
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TTC GAT TCT ATG AAA GAA GCA GTG CTT GAA ATA TTT TAT TTA GCT CCT AAA ATA 

FDSMKEAVLBIFYLAPKI 
CAG CGA ATT AGA GCT ATG AAG GAA ACC AGT ATC CAA GAG GGA TCA GAT AGT CCT 

Q R I R AMKBTSIQBGSDSP 
CTG AAA TCT TTT GAC GTT GAG TTG CGA GAT GTC TCT TTC TCA TAT GAT AAC AAT 

LKS FDVELRDVSFSYDNN 
ACC CCA ATT TTA GAT CAT ATC TCT TTT ACG GCT AAA CAA GGA GAA GTG ACT GCG 

TPI LDH I SFTAKQGEVTA 
TTA GTA GGT GCT AGT GGT TCC GGG AAA ACT AGT ATC TTG AAA TTA GTG TCT AGA 

L V G A tf G S GKTS I L KLV SR 
CTG TAT GAT TAT GAT GAA GGC TGT ATT TTA ATT GAT GGG TAT GAT ATT AAA CGT 

LYDYDEGCILIDGYDI KR 
GTA TCT CCA GCT TCT TTG TTT AGC AAA ATC GCT ATT GTT TTT CAA GAG GTA ACT 

V S P A 0 " L P S K I A I V P Q E V T 
TTG TTT AAT ACA AGT ATT TTG GAA AAT ATA CGT ATT GGA AAT AGT CAA GCG TCA 

LFNTSI L ENIRIGNSQ AS 
GAT GAG GAA GTG AAA AAA GCA GCT CGT TTA GCT AAC TGC GAA GAC TTT ATT GAA 

DEEVKKAARLANCEDFIB 
AAA CTA CCT GAC GGT TAT CAT ACA TTG GTC GGA GAA AAC GGA AGT TCA CTT TCT 

KLPDGYHTLVGBNGSSLS 
GGA GGA GAG CGA CAA CGC TTA TCC ATA GCA CGG GCT TTT TTG AAA AAT GCG CCA 

GGBRQRIiSIARAFLKNAP 
ATT TTA ATT TTA GAT GAA ATT ACT GCC AGT TTA GAT GCG GAA AAT GAA AAA CGT 

I LI L DE I TAS LDAE N E KR 
ATT CAG GAA AGT TTA AAT CGT TTA ATA CAA GAT AAG ACA GTA CTA ATT ATA TCG 
I QBSLHR'ZiIQDKTVLI IS 

CAT CGC TTG AAG TCT ATT GAA AAG GTA AAT AAA ATT GTA GTG ATG GAT CAA GGA 
HRLKSI EKVN KIVVMDQG 

AAA GTA GTT GAT CAA GGT ACT CAT AGT GAG TTG TAT CGA AGG TCA GAA ATA TAC 
K V V D QGTHSELYRR S E IY 

AAA AAT CTC ATA AAG AAA ACT AAA CTA TCT GAG AAA TTC GTC TAT GAA AAA GAA 
KNLI KKTKLSE KPVYB KE 
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GCT CAA TCA CGA TGA 3' 
A Q S R * 



Fig. 15C 



WO 01/79257 PCT/US01/12230 

35/77 

5 ■ ATG ACT AAA AAA ATA TGG TCG CAC TTT CGG AAA AGC TTC ACT TGG TAC ATT TTG 
MTKKIWSHFRKSFTWYII* 
ATT GGC GTG ATT GTT TCA TTA TOT AGT GCT TTA GCA ATT TAT TTT TTT CAA CAG 

IGVIVSLCSA'LAIYFFQQ 
TTG TTA GAT CAT TAT CAA AAA AGT TTC CAA CTA GGG TTG TTA GTA GCG TAT GGA 

LLDHYQKSFQIiGLLVAYG 
ACG ACA ATT ATT TTA ATT CCT CTA TTG TCT TAC TGT GAA CAG AAA CCG AAA GCT 

TTiiLIPLLSYCBQKPKA 
TAC TTA ACT AAT GGC ATC TAT TTC TAT TTG AAG AAA TTA AGT TTA ATA AAA ATG 

YIiTNGIYFYLKKLSLIKM 
AGC AAG ATT TCC TAT GAA GAA TAT CTA AAA CTG GGA GCG GGG GCT TTA CTA CAA 

SKI SYBEYLKLGAGALLQ 
AAA GTA GAA GTT GGT GCA GGA GCT GGG AGA AAT ATC CAC TTG AAT TTT TAT GGG 

KVEVGAAAGRNIHLNFYG 
CGC TTA TTT CGA GAG TTA ATT CCA GAG ACG TTA TTT. AAT TTA TTT TTC ATA GCA 

RLFRELIPETL7NLFFIA 
TTG ATT GAT AAA AAA TTA TTA CCA GCA ATT CTA ATT GGT TAT GTG ATT GTA TTT 

LIDKKLIiPAILIGYVI VF 
ATT TTA ACC AAA ATA CTT TTG AAA ACG CTA CAA AAA ATG AAA GAA AAA ACA CTT 

IL TKIIiLKTLQKMKBKTL 
ATT TCT GAA GAA GCA ATG AAT GCT ACT TTG ATA CGA GGG ATG ACC GAA CTA GTA 

I S B E A M K A T X» I R G M. T B L V 
ACT TTT AGA ATT AAT CGA AAA TAT AAA AAA GAG ATT GAA AAT TAC GCG TTA ATG 

TFRINRKYKK.EIEN YALM 
GCA GAG GAA AAT AGT CAA AAT ATA ACA AAA ATG ACT ATG ATC CAT GAA TTT TTC 

AEENSQNITKMTMIHEFF 
TTT GGC TTT TTT GCC TTG TTA GTT GCA CTT ATC AAA GTA AGT ATT GTC GTA CTT 

FGFFALliVALIKVSIVVL 
AGT TTT ACT AAC GTC GTG ACG CTA AGT TTA GGT GGA CTT GTA GCG ATA GTG ATG 

SF TNVVTLSLGGLVAIVM 
TAT ATC GAT CGA ATT TAT ACG CCG ATA GCC ATA TTT AAT GTT TTA TTT GTT CAA 
y I D R I Y T P I A I FNVL FVQ 
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TAT AAT TTG GAT AAA GTA GCC TAT CAG CGC TTG GAA GAT TTT TAT AAA AAA GAA 
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TGG CCA ATG TTT GCG ATT GGG CGC TEA TTT AAT GTG TTA GAA CGT GGG AAT GCT 
V P m" V a" I G R L P N V L E R G N A 
AGT TAC GAT CCT GTG AAC GAA TTA TTA CAT GAA AAA ACG CAT ATT ATC GAA CGA 

V V D _ R V N E L L H E K T H I I BR 
AAA GAT GCC ATC AAA ACA ATG GCG CAA GGG ACA ATT TCA ATG AAG ATT GAT TCT 

"k'dai ktmaqgtismki DS 

TTT TCT TAT CCA AAA GAA GAG ACG GTG GCG TTG GAA AAT ATC CAA TTT TCG TTG 

P S Y P K E E TVAL EN I Q F S L 
CAA GAA GGA GAA ACC TTG GGG ATT GTT GGC AAA ACA GGC GCT GGT AAA ACC ACT 
~q~ ~e G V T "l G I V G K T G A G K T T 
ATT TTG AAA TTG TTG ATG CGT GAA TAT GAC CAA TAC CAA GGA ACG ATT TCC TTT 

Y "k 'I' h M R E Y D Q Y Q G T I S F 
GGA AAA CAT AAC ATT AAA AAT TAC ACA TTA GAT GCA TTG ATG GGT GCA ATG GGG 

GKHNX K NYTLDALMGAMG 
TAC GTG CCA CAG CAT TTT CTC TTT TCC ATG ACG GTA CGA GAT AAC ATT CGT 

"y" v" p" Q D H F L F. S M T V R D N I R 
TTT GCT AAA CCA CAC TTG GAA CAA GCA GCA GTT GAA CAA GCG GCA GCA TTA GCA 

PAKPHLEQAAVEQAAALA 

TTT ATT AAC CAA GAA ATT AAA GCA TTC CCT GAA GGC TAT GAC ACA ATG GTT GGG 
P I N Q B I K A P P B G Y D T M V G 

GAA CGT G 3 1 
B R 
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5 1 ATG AAA AAG TAC GTG AGT TCG CTA AAA ACT TCG TTT CAT TTC GTG TAT AAT AAG 

MKKYVSSLKSSFHPVYNK 
AGA AAG AAA AAA ATC CAT TTA ATT AAG GGA GTA ATT TTT GAA TTC TTG GTT CGC 

RKKKIHLIKGVIFEF LVR 
TTA GGG TTT AAT AAT GGT AGA GAG AAA AGG TTG GAG GCA AAA GAA ATG AGC TTA 

LGP NNGRB KRLEA KEM S L 
ACA ATT GAA CAT TTA ACA GGA GGT TAC GGC CAT ATT CCT GTC TTA AAA GAT ATT 

TI EHLTGGY GHI PVLKDI 
AAT TTT GAC GTC AAG TCT GGT GAA ATG GTT GGT TTA ATT GGC TTG AAT GGT GCG 

NFD VKSGEMVG LIGIi NGA 
GGG AAA AGT ACC ACC ATC AAA AAT ATT ATT GGG TTA CTT ACQ CCG CAA AAA GGC 

GKSTTXK NIXGLXiTPQKG 
AAA ATT ATG ATT GAC GGG GAA ACA TTG CAA CAA GCC CCT GAA GAA TAT CGT AAA 

K I M I DGETLQQAPEBYRK 
AAA ATC GGG TAT ATT CCG GA*. ACA CCT TCT TTA TAT GAA GAA TTA ACA TTG AAA 

K I G Y I PE TPSLYBEL .T LK 
GAG CAT ATC GAA GTA ACQ GCC TTG OCT TAT GAT ATT CCA TTA GAA GAA GCG TTC 

EHXEVTALAYDI PLBEAF 
AAA CGA GCA GAA CCA TTA CTA AAA ACQ TTC CGT TTA GAC AAT AAA TTG GAA TGG 

KRABPLLKTFRliDNKIiEW 
TTT CCT GCT AAT TTT TCA AAA GGC ATG AAA CAA AAA GTC ATG GTA CTT TGT GCA 

FPAN FS KG MKQKVMVI* CA 
TTC TTA ATT GAA CCG AGT TTA TAT ATT ATT GAT GAA CCT TTT CTA GGC TTA GAT 

FLIE PSLYIXDEPFLGLD 
CCT TTA GCA ATT CAT GCT TTA TTA GAA TTA ATG GAT ACG ATG CGT AAG CAA GGG 

PLAIHAL LBIiMDTMRKQG 
GCA GCG ATT TTA ATG TCC ACG CAT ATT TTA GCA ACA GCT GAA AAA TAT TGT GAT 

A A X L M S TH I L A T A E K Y C D 

CGT TTT GTG GTG CTA CAC GAA GGG AAA TTA CGG GCA AAC GGT ACA ATG GCT GAA 
R F V V L H'E-G K It R A N G T M A B 

TTA CGT GCA GAA TTT AAT TTA CCA GAG TCT TCT TTA GAT GAT ATT TAT CTT GCC 
LRAEFNIiPESSIiDDIYLA 
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TTG ACG AAG GAA QAA AAG GTG GG6 TAA 3 1 
LTKEEKVG* 
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5' ATG AAA AAA TTA AGT ATT CAT CTG AAA GAC GTA AGT ATT CAC TTT TCG GGA AAA 

MKKLS. IHLKDVS IH PSGK 
CCA ATT CTA GAA ATT GAT GAA TTA TTT GTA TAT GAA AAT GAA AAA ATT GGG ATT 

PILEIDBLFVYENBKIGI 
ATT GGT AAA AAC GGC GCA GGA AAA TCA ACA CTG CTC AAT TTA ATT ATG GGT AAG 

I GKNGAGKSTLLNL IMGK 
ATT CAA TCA GAT AAA GGA AAA GTT CAA AGA TTG AAC GAC TxT CAT TAC TTG GCA 

I QSD KGKVQRIiNDFHYLA 
CAA GTA GCT GAA GAA ATA ACG AAT GAA TCA GAG AAA ACT GAC AAA AAT TGT TTG 

QVAE E I TN ESEKTD KNCL 
CTA AAT CAG AAA AAT CAA AAA CTA AGT GGT GGC GAA AAA GTC CAA AAA CGC TTA 

IjNQKNQKLSGGEKVQKRL 
GCA ACA TTA TTT TCA GAG TAT CCA ACC GGG GTT ATT TTA GAT GAA CCA ACC ACA 

ATLFSEYPTGVILDBPTT 
CAT TTA GAT AAA GAA CAT CGT CAG TTG TTA GTG GCA GAT TTA ACG TAT TAT TAT 

HLDKEHRQLLVAD V TYYY 
GGG ACC GTT TTG TTT GTT AGT CAT GAT CGC TTT TTC TTG AAT CAA TTA GCA GAG 

GTVL FVSHD'RFFIiNQIiAE 
AAA ATT TGG GAA GTT TCG GAT GGA CAT GTC AAA GAA TAT TTA GGG AAC TAG GAT 

KI WBVSDGHVKE YLGNYD 
GCG TAT TGT CGT CAA AAA GAA TTG GAA CAG CAA ACA CAA TAT AAT GTC TAT CAT 

AYCRQKELEQQTQYN VYH 
CAG TAT CAA AAG GAA AAG AAA AAT TAC AGG AAT CTT ACG CAA AGA AAC AAG CAC 

QYQKBKKNYRNLTQRNKH 
AAG CGC AAA AAT CTA GTC ATG TTT CAA AAA AAC AWA AWC AAA AGC ARA TTA AAC 

KR KNLVMFQKNXX KSXLN 
CTA GTC GTT TAG 3 1 

L V V * 
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5 t ATG ATT TTA TTA CAA GCA AAT CAA GTT GCC CGG CAT TTT GGC TCG GAA ACA TTG 
MILLQANQVARHFGSETL 
TTT GAA AAC ATA CAT TTA GAA ATT GCA ACA AAA ACT CGG ATT GCC TTA GTT GOT 

PBNlHIiBIAT KSRIALVG 
CGT AAT GGT GCT GGA AAA TCG ACT TTT TTA AAA ATC ATT GCA GGC ATT GAT GCT 

RNGAGKSTFLKIIAGIDA 
COT. GAT AGC GGA ACC ATT GCC AAA AAT AAA ACT GCT ACG TTA GGT TAT TTA GCT 

p D S G T I A K N KTAT I» G Y Ii A ' 
CAA AAT ACC GGC TTA GAA TCA GAT AAA ACC GTT TGG GAA GAA ATG ACA AAA GCC 

QW T GLE SDKTVW E E M T KA 
TTT GCT GAC ATC CTA GAA ATG GAA CAG CGT ATG CGA GAA TTA GAA ACT AAA ATT 

FADILEMEQRM RELETItl 
AGT GAA ATG GAG CCA ACC ACT TCC GTT TAT GAA GGA ATT TTA AAA GAG TAC GAT 

SEMBPTTSVYEGILKBYD 
CAA TTG CAA CAT ACG TTT TCT GAA AAA AAT GGC TAC GGC TAT GAA AAT G*£ ATT 
"Q L Q H T F S E KN G YG Y E N E I 
CGC TCA GTC CTT CAC GGC TTT GGC TTT GAT GAA TCC TTT TAC ACG AAA GAT ATT 

RSVLHGF GFDESFYTKDI 
CAA ACC TTA TCT GGT GGT CAA AAA ACC CGG CTT GCA TTA GCG AGA ATG CTT TTA 

QTLSGGQKTRLALARMIiL 
CAA AAA CCA GAC ATT TTA ATT CTG GAC GAG CCT ACA AAC CAC TTA GAT ATC GAG 

Q K P D I L I LDE PTNM LD I B 
ACG CTT TCT TGG CTG GAA TCT TAT TTG CCA AGT TAT GCC GGC GCC CTA TTA ATT 

TLSWXiBSYLP. 8YAOALI1I 
GTT TCC CAC GAT CGT TAT TTT TTA GAT AAG GTA GTT AAT GAA GTT TAT GAA CTG 

VSHDRY FLDK VVNBVYEL 
AGT CGC AAA AAA ATG ACT CAC TAC AAA GGA AAC TAT TCC AAA TAC TTA GAG TTA 

SRKKMTH YKGNYS KYLBL 
AAA GCA GAA CAA TTA GCC AGT GAA TGG AAA GCG TAT GAA AAG CAA CAA GAA GAA 

KABQIiASEWKAYEKQ QEE 
ATC AAT AAG TTA GAA GAT TTC GTT GCC AAA AAT CTG GTT CGT GCA TCT ACA ACG 
I NKL ED FVAKNLV RAS T T 
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TGG TTG GAA TCA AGC CAA CTA GCG ACA AAA TTG CAA GCA CAA TCT AAT GCC AAA 

WLESSQLA TKLQAQSNAK 
AAA GAA GAA CAA ATC AAA GAA TTA CAA GAC TTT ATC GCT CGT TTT AGC GCC AAT 

KEEQIKELQDFIARFSAN 
GCC TCA AAA TCA AAA CAA GCA ACG TCT CGT AAA AAA ATG TTA GAT AAA ATT ACC 

AS KS KQAT S , R K KM LD K I T 
TTA GAT GAC ATT CAA CCT TCC TCT CGT CGT TAT CCA TTC GTT GGT TTT ACA CCA 
LDDIQP-SSRRYPFVGFT.P 

GAA CGT GAA ATC GGC AAT GAC TTA TTA CAA GTT GAA AAT GTC TCT GTA ACG ATT 
EREIGND LLQVENVSVTI 

GAT GGC AAA AAA ATC TTA GAT AAT ATC TCG TTC AAC TTA ACA AAA GAT GAT AAA 
DGKKILDNISFNLTKDDK 



Fig. 25A 



WO 01/79257 



54/77 



PCT/US01/12230 



GTG GCA TTC ATC GCT GAC TCA GAT ATT ACA ACS ACT ACQ TTA TTC AAA GTG ATT 
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5 ' ATG TCG AAA ATT GAA CTA AAA CAA CTA TCT TTT GCC TAT GAT AAT CAA GAA GCG 
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AAA CTA TTA AAA GAT CTT GAG TAT ATT GAT TCT TTG TCA ATG GAT TAT CAG CCA 

K LLKDLBYIDSLSM DYQP 
ACG CAT CAC AAA ACA TTA TTG ACG GTG GAA GAG CTT CGT CTA GGC TAG GAG AAA 

THH KTLLT -VEBLRLGYEK 
AAT TGG CTG TTT GCG CCA ATT TCT TTT TCA ATA AAC GCG GGA GAA ATT GTC GGA 

NWL PAP I S FS INAGE I VG 
ATA ACA GGA AAA AAT GGC TCA GGA AAA TCG AGC TTG ATT CAG TAT TTA TTG GAT 

I TG KNGSGKSS L I Q Y L LD 
AAT TTT TCT GGG GAT TCA GAA GGC GAA GCC ACT TTG GCT CAC CAA TTA ACC ATT 

N FS GDS BGEATLAHQLTI 
TCT TAT GTG CGC CAA GAT TAT GAA GAC AAT CAA GGA ACT TTA TCC GAA TTT GCA 

SYVRQDYEDNQGTLSE FA 
GAG AAA AAT CAG TTA GAT TAC ACC CAA TTT TTA AAT AAC TTA CGA AAA CTT GGG 

EKNQLDYTQ FLNNLRKLG 
ATG GAG CGT GCC GTT TTC ACT AAT CGA ATT GAA CAA ATG AGT ATG GGG CAA CGG 

MERAVFTNRIEQMSMGQR 
AAA AAA GTC GAA GTA GCC AAA TCA TTG TCT CAA TCA GCA GAG CTT TAT ATT TGG 

KKVEVAKSLSQSAEI.YIW 
GAT GAA CCC CTT AAT TAC TTG GAT GTG TTT AAT CAT CAA CAA TTA GAA GCG CTA 

DEPLNY LDVFNHQQLEAL 
ATC TTA TCT GTG AAG CCT GCA ATG CTA GTG ATT GAG CAT GAT GCG CAT TTC ATG 

I LSVKPAMLVI EHDAH FM 
AAG AAA ATA ACG GAT AAA AAA ATT GTC TTG AAA TCA TAA 3 » 

KKI TDKKIVLKS* 
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5' ATG CCG TTT ATT CCA GTA TAT GTT GAA CAA TTA GGG ACG CCT AAG AGT CAA GTA 

MPFIPVYVEQLGTPKSQV 
GAG TTG TTT TCT GGA TTG GCA ATT TCT GTC ACG GCG TTT GCT GCT GCG ATT GTT 

ELPSGLAI SVTAFAAA IV 
GCA CCG ATT TGG GGA AAT TTA GCT GAT CGA AAA GGT CGG AAA ATT ATG ATG ATT 

APIWGNLADRKGRK IMMI 
CGG GCT GCA GCA GGA ATG ACG ATC ACA ATG GGG GCG TTA GCA TTT GTC CCT AAT 

RAAAGMTI TMGALAFV PN 
GTA TAC TGG CTG TTG ATT ATG CGT TTT ATC AAC GGG ATT TTA TCC GGC TAT ATT 

VYWLLIMRFINGIIiSGYI 
CCT AAC GCA ACA GOG ATG ATT GCG TCG CAG GCG CCA AAA GAG AAA AAT GGC TGG 

PNATAMIASQAPKBKNGW 
GCT CTA GGG ACA TTA TCA ACA GGT GCA ATT GCT GGG ACG TTA ATT GCG CCA TCG 

ALGTLSTGAIAGTIiIAPS 
ATT GGC GGT GCG TTG GCA CAG TGG TTT GGA ATG GAA AAT GTC TTT ATT ATT ACC 

IGGALACWPGMENVPI IT 
GGC GTT ATT TTA TTC ATT ACA ACG TTG TTG ACT ATC TTT TTA GTT AAA GAA GAT 

GVI LFITTLLTI FLVKBD 
TTT CAA CCA GTT GAG AAA AAG GAT TTA TTA ACG ACG AAA GAA ATT TTT AGT AAG 

F QPVEKKDIiL TTKEIFSK 
ATG GAT CAT GTT TCA GTA TTG ATT GGT TTG TTT GTG ACT ACG TTA ATT TTA CAA 

MDHVSVIi IGLFVTTLILQ 
TTA GGA ATT ACA AGT ATC AGC CCA ATC TTA ACG CTA TAT ATT CGA TCT TTA AGT 

L G I TSISPILTLYIRSLS 
GGC GAT ACA GAG AAT GTA TTA TTT GTT AGT GGT TTG ATT GTT TCG ATT GCC GGC 
GDT ENVLFVSGIiI VSI AG 

GTT TCA GCA ATT ATT TCT TCT CCA ACT TTG GGA AAA ATT GGG GAC CGC ATT GGG. 
VSAIISS PTLGKIGDRIG 

AAC CAA AAA GTT TTA TTA GGC GGA TEA ATT CTT TCT TTT ATT TGT TAC ATT CCC 
NQKVLIiGGLI L SPICYIP 

ATG GCG TTT GTG CAA ACA CCT TTT CAG TTA GGT GTT TEA CGA TTC CTT TTA GGT 
MAFVQTPFQLGVLRFLLG 
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TTT TCA ACA GGT GCT TTA ATG CCA TCA ATT AAT ACQ TTA ATT ACT AAA ATA ACG 

FS TGALMPS I.N T L I S KIT 
CCA ACA GAG GGC GTA AGT AGA GTT TAT AGT TAC AAT CAG ATG TGT ACT AAT TTC 

PTBGVSRVYSYNQ MCSNF 
GGT CAA GTT TTA GGA CCG ATG GTA GGG TCG ACA GTT GCC CAT GGT TTT GGC TAT 

GQVLGPMVGSTVAHGFGY 
TCT TCT GTT TTT CTG GTA ACC GCT TGT TTC GTT TTA GGA AAC ATT GGG CTG TCC 

S S.VFIiVTACFVLGNXGXiS 
TTC TTT AAT TTC CGA AAA GTT TTA AAT AAA AAG CTC TAA 3 • 

F F N P R KV h N K K L * 
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5 • ATG GCT GTT TTT TTA TGG AGG AGG CAA ATT ATG ACG AAA AAA AAT AGT ATG ATG 
MAVPLWR RQIMTKKNSMM 
TAG TTA GCA ATT TCT AAC TTA TTT CTT GTT TTT CTA GGC GTA GGC CTA GTC ATT 

YLAI SNIiPLVFLGVGLVI 
CCC GTA ATT CCC CAA TTA AAA GAA GAA ATG CAT TTT TCT GGT ACC ACA ATG GGA 
pVIPQIiKEEMHFSGTTMG 

ATG ATG ATT TCT ATT TTT GCG ATT GCC CAA TTA ATC ACA TCG CCA ATC GCA GGT 

M M I S IFAI AQLI TS P IAG 
GTC CTT TCG GAT AAA ATT GCT CGG AAA AAA ATG ATT GCA AOG GGC ATG TTG GTG 
"v" ~J* S D KI GR KKM I A.T G M L V . 
TTT TCA ATT TCT GAG TTA TTA TTT GGT TTA GCC CAA GCG AAA AGC GGT TTT TAT 

p s I S ELLF GLAQAKS G FY 
ATT TCT CGT GGT TTA GGC GGG ATT GCC GCC GCT TTA TTA ATG CCG TCA GTG ACA 
"l" SRGLGGIAAA LLMPSVT 
GCC TTT GTG GCA GAT ATG ACC ACG ATT TCT GAA CGT CCG AAA GCG ATG GGG CTT 

A FVADMTT I S ERP K A M G L 
GTG TCA GCC GCA ATT AGT GGT GGT TTT ATT ATC GGA CCA GGA GTT GGT GGT TTT 

V "s" A A I S G G F I I G P G V G G F 
ATT GCT TAT TTA GGT ATT CGC GCC CCG TTT TTT GCA GCC GCA TTT TTA GCG TTT 

IAYLGI RAPFFAAAFIiAF 
ATT GGT TTT ATT TTG ACA TTA ACT GTT TTG AAG GAG CCA GAG AAA CGA ATT TTA 

IGFILTIiTVLKBPBK RIL 
GCC GCT GTT GAA GCG AAA AAA GGT TCA TTT ATG GAT ATT TTA AGA AAT CCA ATG 

A a" v" "e" A K K G S F M D I L R N P M 
TTT ACC TCA TTA TTT GTG ATT ATC TTA ATT TCC TCT TTT GGC CTG CAA GCG TTC 

FTSLFVI I L ISS FGLQAF 
GAA TCT ATT TAT AGT ATT ATG GCG ACT ATT AAT TTT GGC TTT ACC ACA AGT GAA 

V s i YSIMATINFGFTTSE 

ATA GCA ATC GTG ATT ACG GTT AGT GGT ATT TTA GCG TTG ATT TGT CAG TTG TTT 

IAIVITVSGILALICQIiF 
TTC TTT GAT GCA ATC GTC CAA AAA ATA GGT GAA ATG GGT TTA ATC CAA TTA ACC 

V V ~D~ A I V Q K I G E M G L I Q L T 
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TTT TTT GCA AGT GCC ATT TTT ATT GCC GTG ATT GCC TTT ACA AAA AAT AAT TTA 

F F A S A I P IAVIAPTKNNL 
GTT GTT GTA TTT TCA ACG TTT ATT GTC TTT TTA GOG TTT GAC TTG TTT AGA CCA 

VVVFST PIVFIiAFDLFRP 
GCA GTA ACT ACT TAT TTA TCC AAA CAT GCT GGA GAT CAA CAA GGA ACC ATC AAC 

AVTTYLSKHAGDQQGTIN 
GGA CTA AAT TCG ACA TTT ACA AGT TTT GGT AAT ATT TTA GGA CCA ATG GCA GCA 

GLNSTPTSFGNILGPMAA 
GGA GCT TTA TTT GAT ATC AAT CAC TTT TTC CCT TAT TAT GTT TCA GCA GTA ATT 

G AbFDINHFFPYYV SAVI 
CTG TTA GGA ACG GGC TTT TTA TCG TTA TTT TTA AAT CGA AAT AAG ATG. TA7V 3 » 

IaL GTG P L S L F I> H R H KM 
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5 ' ATG ATC GAT AGA AAA AAA GTT ATT TTG TAT ACT TGT TGC ATG AGT TTG TTT 6TA 
MIDRKKVILYTCCMSLFV 
GTG ACC ATG GAT GTT ACG GTT GTC AAT GTG GCG TTA CCA TCC ATT CAA AGT GAT 

VTMDVTV VNVALPSIQSD 
TTT CAC ACG AAT CTG TCT ACA TTA CAG TGG GTA ACA GAT GGC TAC. ACT TTA ATG 

PHTNLSTLQWVTDGYTLM 
GTA GCA TCC TTA TTA TTG TTG TCT GGG TCT ACA GCA GAT CGA ATT GGC CGT AAA 

VASLLLLS GSTADRIGRK 
CGA GTC CTT CAA TTG GGC TTA GCC TGT TTT GGT TTA GCG TCT TTC CTA TGT GGG 

RVLQLGLACPGLAS F C G 

ATT TCG CAA ACC CCA GGG CAA TTG ATC GCG TTT CGC ATG TTG CAA GGG ATT GGT 

ISQTPGQLIAPRMLQGIG 
GGT TCT ATG TTA AAT CCG GTA GCA ATG TCT ATT ATT ACA CAG GTA TTT ACC GAA 

GSMLNPVAMS I ITQVPT. E 
AAG TTA GAA AGA GCG AAA GCA ATT GGC TTG TGG GGC TCT GTT ACA GGG ATT TCC 

KLB RAKAI GLWGSVTGIS 
TTA GGC ATG GGC CCG ATT ATT GGT GGA CTG ATT GTT TCT TAT TTT AGC TGG CGG 

LGMGPIIGGLIVSYFSWR 
TAT GTC TTT TTT GTA AAT GTA CCG ATC ATT GCT GCG GCA ATC ATC CTT ACA CAA 

YVFFVNVPI I A A A I I I» T Q 
AAG TTT GTA CCT GAG TCA AAA GTA GAG AAG ACG GTG AAA AAT GAT TTT GTT GGT 

KFVPESKVBKTVKND FVG 
CAA GCA TTG ATG ATT CTT TTT CTA TTT AGT TCT ATC TAT TCC ATT ATC GGA CTA 

QALMILFLFSSIYSIIGL 
CCT AGA AAA GGG CTT TTC GCG CCA GAT ATT TTA AGT ACT GGG ATA ATC GGC TGC 

PRKGLFAPDILSTGI I G C 
TTA GCC ATT GTT ATT TTC TTT ATT TAT GAA TAT AAC ATT GAC AAT CCG TTA ATC 

Xi A I V I FFIYBYNIDNPLI 
AAC CCG CGT TTC TTT TTA TCT ATT CCA TTT ACA TCG GCT TCT TTT TTA GCT ATT 

NPR FFLS I PFTSAS FLAI 
TTT GGC TTT ATC ATA TAT AAC GGC TAT TTA TTT TTA AAC ACG CTA TAT TTG CAA 
FGFI lYNGYLFLNTLYLQ 
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GAG ATG AGA GGC TTC AGC CCG TTG GAA GCT GGC TTA TCA ACC ATT CCT TTG GCT 

EMRGFSPL E A G L S TI P L A 
TTT GTT ACT TTT CTT GTT GGA CCG AGA GCC GGC GAA ATG GTA GGG AGA ATA GGG 

F V S FLVAPRAGEMVGRIG 
ACG AAA CGT CCT ATT ATG CTT TGT GGT ATT TCA ATG TTG GCT GTT AGC TTT TTA 

TKR P I MLC G I SM LAV S PL 
CAA TTA TTT GTA ACT AAA AGA ACG CCT ATG ATT ATT TTA TTT ATT ATT TAT ATC 

QLFV T KTTPMIILFIIYI 
TTT TTA GGC ATT GGG TTT GGG ATG TTA AAT TCA CCG ATT ACC ATT ACA GCG ATT 

FLG I GFGMLNSP I TI TAI 
GAA GGA ATG CCA CTT TCT CAG TCA GGA ACT GCA GCA GCC ATT GCG GTG ACA TGT 

EGM PLSQS GTA AAIAVTC 
AAG CAA ATT GGC AAT TCT TTA GGG GTG GCG CTA CCA AGT CTT TTA ATT ACA AAG 

K Q I GNSLGVALPSLLITK 
CCT ATT ACT AGT TCG CTT ACT CGA ACA CCT TTT ACA AAC GTA TGG CTT TTA TTT 

PITSSLTRTPFTNVWLIiF 
GGA TGT TGC GCC ATT GCG ATT ATC TTT TTA AGT TAT T 3 ' 

GCCAIAIIFLSY 
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5* ATG GCA AAA GAA ACA AAT GTT AAG TTA GTC ACG GTG ACT GTT TTT GTG GCA ACA 
MAKE TNV KLVTVSVFVAT 
TTT ATG ACA GCC ATT GAA GGG ACC ATT GTG TCT ACT GCG ATG CCA ACG ATT GTC 

FMTAI BGTIVSTAMPTI V 
GGC TCG TTA CAT GGC ATG GAA ATT ATG AAC TGG GTA TTT TCA ATT TAT TTA TTA 

GSLHGMBIMNWVPSIYLL 
ACG AAT GCG ATG TTA ACA CCG ATT TAT GGG AAA CTT GCG GAT AAA ATT GGT CGT 

TNAMLTP I YGKLADKI G R 
AAA CCT GTC TTC ATG ATT GGC ATT ATC ATT TTT ATT TTG GGC TCC TCG TTG TGT 

KPVFMIG IIIFILGSSLC 
GGC TTT GCT CAA GAT ATG TTG ACT TTA ATT ATT GCC CGC GCA ATT CAA GGT GTG 

GFA QDMLTLI IARAI QGV 
GGG GCA GGC GCA ATT TTA CCA GTT GCG TTA ACG ATT ATT GCC GAT ATG TAT ACA 

GAGAI LPVALTI IADMYT 
TTG GAC AAG CGA GCG AAA ATT TTA GGT TTJV AAC ACT GCC GCC TGG GGA ATT GCT 

IjDKRAKILGIj NSAAWGIA 
AGT ATT TTT GGT CCG TTA GCA GGT GGT TTT ATT GTA GAT ACA GTC GGT TGG CAT 

SIFGPLAGGFIVDTVGWH 
TGG ATT TTC TTC ATT AAT GTT CCT ATT GGA CTT GTT TTA TTG GGC TTG ATT AGT 

WIFFINVPIGLVLLGLIS 
ATT TTC TTA GTT GAA CCA AAG GGG GAA CGG ACC AAG ATG CCA ATG GAT ATT TTG 

I FLVE PKRERTKM P MD I L 
GGC AGC GTT ACT TTG ATG GTA GTG CTG CTA ACG TTA TTG CTA GGT TTT CAA ATG 

GSVT LMVVLLTLLI»GFQM 
ATT AGC GAT AAT GGT TTT ACA TTA GTA ACA TTT GGT TGT TTA AGT TTG AGT GTG 

ISDNGFTLVTFGCI»SI*'SV 
CTC TTT TTT GTA GCA TTT GTG ATG ATA GAA AAA CGC GCG CAA GAC CCA GTG ATT 

LFFVAFVMIEKRAQD PVI 
GAT TTG CAT TTA TTT AAT CAA CCA ACG TTT GTT TTA GTA AAT CTT ATT GCA GCG 

DL HLFNQPTFVLVNL I AA 
CTT ATT AGC GGT TTC TTA ATG GGG ATT GAT GTC TAC ATT COG ATG TGG ATG CAA 
LISGFLMGIDVYI PMWMQ- 
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GGT GTC TTA GGA AAA ACT GCA GGA ATT GGT GGC TTA GTT TTA GOG CCT ATG TOG 
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5 1 ATG GCT GGT ATT G6C TTT AGC TTA GTG ATG CCA TTT ATG CCT CTT TAT ATC AAC 

MAG I GPSLVMPFMPLY IN 
ACA TTA GGT ACT TTT ACC CAC CAA CAA TTG AAT TTT TGG AGC GGA ATC ACG TTC 

TLGTFTHQQLNFWSGI TF 
TCT TCC ACT TTT TTA GTC ACA ACC ATT GTT TCA CCT TGG TGG GGC CGT TTA GCC 

SSTFLVTTIVS PWWGRLA 
GAC CGA AAA GGA CGC AAG TTA ATG TTA TTA CGC GCT TCA TTA GGG ATG GCT ATC 

DRKGRKLMLLRASLGMAI 
GTC ATT AGT TTA ATG GGC GCA GTC ACA AGT GTT TAT CAA TTG ATT GGC TTA CGC 

VISLMGAVT.SVYQLIGLR 
TTA TTA CAA GGT GTT TTT TCT GGT TAT ATC AGT AAT GCG ACT GCG TTA GTT GCT 

LLQGVFSGYISNATALVA 
ACA GGC ACA CCT AAG GAA AAA AGC GGT CAA GTG CTT GCT ACT TTA GCC ACT GCT 

TGTPKBKSGQVLGTIiATG 
TCT GTT ACC GGC ACG TTG CTT GGT CCT CTT TTG GGC GGT GTC ACT GCG TCT ATT 

SVTGTLLGPLLGGVTASI 
TTC GGT TAT CGG CCA ACC TTT TTC ATT ACT GGT ACG ATT TTA TTA CTC GOT TTT 

FGYRPTFFITGTILLLVF 
GTT TTG AGT CTC GTC TTT GTC CAT GAA GAG TTT GTG CCA ATT GAA AAA AAT CAA 

VL S LVFVHEB FVP I B K N Q 
GCC GCA TCT GGG AAA CAA ATT TTA AAA AAA CTA GAA CAT CCC CAC GTG ATT CTT 

AASGKQIIiKKLBHPHV I L . 
GGA ATG TTT ATT ACG ACA TTA ATC ATT CAA GCT TCC AAT AAT TCA ATT AGT CCC 

GMFITTLI I Q A S N N S I SP 
ATC ATC AGT TTA TAT ATT CAA CAA TTG TTG GGT GGT CAC GGA AAT GTC ACC TTA 

I I S LYIQQLLGGHGNVTL 
ATT AGC GGA GTC ATT GCT TCT ATT CCA GGG ATT GCA ACG TTA ATA GCC GCC CCT 

I S GV IAS I P G I A T L I A A P 
CGT TTC GGC CGG TTA GGC GAT CGT ATT GGC AGT GAA CGC ATC TTG ACA ATT GGC 

RFGRLGDRIGSERILTIG 
TTA ATT TTA GCC ATT TTT GTT TAC CTA CCA ATG GCC TTT GTT CAA AAT GTC TGG 

L I LAI FVYLPMAFVQ NVW 
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5' ATG CAA TTT AAT TTG AAA CTT ACA GAA AAA GGA GCG ATA TAT GTG ACT TCA CAA 
MQ PNLKLTEKGAIYVTSQ 
CAA CCT GTG GAT ATT CAC GGA AAA CCT TAT AAC CGG TCG TTA CTA GTC GGT GTT 

Q PVD I H G K P YNRS h LVGV 
TTA CTA ATC GGA ACA TTT TGT ACG ATT TTA AAC CAA ACG CTT TTA ACQ ACQ GCC 

LLIGTFCTI IiNQTLLTTA 
TTA CCT ACA TTA ATG AAA GAA TTT GAT ATC TCA GCT TCA ACT GTA CAG TGG CTA 

LPTLMKBPDISASSVQWL 
ACC ACT GGA TTC CTA CTT GTG AAC GGC ATT ATG ATT CCC ATC ACT GCT TGG TTG 

TTGPLLVH GIMIPISAWL 
ATT AAC AAA TTC AGT TCA AAA AAA TTA TAC ATA ACA GCG ATG TCA ACG TTC TTA 

INKFSSKKLYITAMSTFL 
ATC GGA ACC ATC ATC TGT TTT GTA GCG CAA GAC TTC GGC ATG TTA TTG ACT GGA 

I G ■ T I I CPVAQDPGMIiLTG 
CGT CTT GTT CAA GCC GCA GGG GTC GGT GTA TCT ATG CCT CTT TTA CAA ACA ATT 

RLV. QAAGVGVS. MPLIiQTI 
ATG CTT TCA ATT TTC CCG CCT GAA AAA CGT GGC GCT GCC ATG GGC ACA ACG GGA 

MLSIFPPBKRGAAMGTTG 
ATT GTT ATC GGA TTA GCA CCC GCT TTA GGT CCA ACA TTA TCT GGT TGG ATT ATT 

I V I GLAPALGPTXiS GW I I 
GAT TCT TAT ACA TGG CGT GAC CTT TTT GGG ATG GTC ATT CCA ATT GTC GTT TTA 

DSY TWRDLPGMV IPIVV L 
GTC CTA ATT TTA GCT TCA TTT TTA ATG AAA AAT GTC ATT CAA TTA TCT AAT CCA 

VLI LASFLMKNVIQLSNP 
AGT ATC GAT GTC TTA TCG GTC ATT CTT TCT ACT CTC GGC TTT GGT AGC TTA CTT 

SIDVLSVILSTLGFGSLL 
TAC GGT TTC TCA AGC GTT GGT GAT AAA GGT TGG GGT AGC CCG CAA GTC TAT GGA 

YGFSSVGDKGWGSPQVYG 
TTC TTA ATT GTT GGT GCG ATT GTT TTA TGT CTT TTT ACA TAT CGA CAA TTG CAT 

FLIVGAI VLCLFTYRQLH 
TTA GAA CAA CCT TTC TTA GAA CTA CGC GTT TTT AAA TCA AAA GTC TTT ACT GTT 
LEQPFLELRVFKSKVFTV 
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GOG GCC ATT TTA TCT GGC GTA ACC AAC ATG GCA ATG ATT GGC GCC GAA ATG GTT 

A A I L S G 'V T N M A M I G A E M V 
TTA CCT TTG TAT ATC CAA AAT ATT CGT GGC GAA TCT GCC TTC CAT TCA GGG TTA 

L PL Y I ON I RG BSA PH S G L 
ATG CTT TTA CCA GGA GCG TTA GTC ATG GGA CTT ATG ATG CCA GTA ACT GGT CGC 

MLLPGALVMGLMMPVTGR 
ATT TTT GAT AAA ATT GGT GCC CGT CGT CTT GCG ATT ACT GGG ATG TTT ATT TTA 

I FD K I GARRIiAI T G.M F I L 
ACA GCT GCA ACG TTA CCG TTT GCT TTC TTA ACA AAA GCA ACA CCT ATT ATT TAT 

TAATLP FAFLTKA TP I IY 
ATC ATT GTC CTA TAT GCG ATT CGG ATG TTT GGT ATT TCG ATG GTC ATG ATG CCT 

I IVLYAI RMPG I S MVMMP 
GTG ACA ACT TCT GGC ATG AAT GCT TTA CCA ATG AAT TTA TTA AGT CAC GGG ACT 

V T T S G M N A L P M N L L S H G T 
GCT GTA AAC AAC ACC TTC CGA CAA GTA GCC AGC TCA ATT GGA ACA GCT GTT TTG 

AVNNTF RQVASS I GTA VL 
ATT AGT GTT TTA ACA AAT GTT ACC AAA GAC GGC TTA CCT GCA TCG GAC TTA TTA 

I SVLTNVTKDG LPAS DLL 
AAA ACA GCC CCA CTC ACT TAT CGC GAT CAA GCA ACA AAC GCA ACA CTA AAC GGT ^ 

KTAPLTYRDQATNATLNG 
TAT CAC GCA GCC TTC TTC GTG GCA ACA ATC TTT GGC GTT CTC GGC TTG GCG ATT 

YHAAFFVATI FGVLGL AI 
ACT TTC TTC TTA AAT AAA AAA GAA GCG ATG CCT GTT AAG GAA GTA GGT GCA ATG 

TFFLNKKEAMPVKEVGAM 

AAA TAA 3* 
K * 
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5* ATG GAG AAA ATT AAA AGT TAC CAA GAG GAT TCT GAA GTG CAG AAA AAT CGT TGG 



M E K I K 
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I V N I A 
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V7^*Y? 4wv w*\* 
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I G S L L C G P N 


Q 8 L S 
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P L L F A 


RVVQSIGAS 


M T M A 


ACC AAT TCA GCA ATT ATT ACG GAA GTA TTC CCT TTA AAT GAA AGA GGG CGG GCG 


T H S Q . I 
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R G R A 
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K I F T I* 
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TTT GTA AAT GTT GTC ATT CCA TTT TAT TTG CAQ GAT GCA CGC AAA ^A^AGT GCT 

FVNVVIPFYLQDARKLSA 
ACT* TAT GCT GCT CTA TTA ATG ATG GTA TTT CCG TTA TTA ATG GTG GTT GGG GCT 

SYAGLLMMVFPI.LMVVGA 
CCT CTG ACT GGC TAT TTG ACG GAT AAA ATT GGC CCA GGT ATT TTA ACA TTT GGC 

PLSGYLTDKI GPGlIiTFG 
GGA TTG TTG CTC TTG TGC TGT ACG TCG TTA ATG TAT ATG TTT TTA GAT ATG AAT 

GLLLLCCTBLMT'MFZiDMH 
TCG CCT ATC TGG TAT TAT GTG ATT GCA ACG GCC ATT ATG GGC TTG GGA AAT GCA 

S PI WYYVIATAIMGLGNA 
CTT TTC CAG TCT CCA AAC AAT ACA ATG GTT ATG AGC ACT GTT GAA AAG CAA GAT 

I»FQ S PNNTMVMS SVE^KQD 
TTA GGT GTA GCA GGA ACT ATG AAT TCT TTT GCT AGA AAC TTA GGA ATG GTC ATT 

LGVAGSMNSFARNLGMVI 
GGG ATT GCG TTG TCA ACG ACC ATT TTA TAT CGT GGC ATG AGC GAA GCC TAT GGT 

GIA LSTTILYRGMSEAYG 
GAA CGA GTA ACC ACG TAT CTG GCT AAT CGC CCA GAT ATA TTT ATT GTG GGA ATG 

B RV TTYLAN R PD I F I V GM 
CGT GAA ACC TTT TTT GTC GCC TTT CTA TTA TGT GTG GCG GCC TTT ATA TTA ACG 

RET FFVAFLIi CVA AF I LT 
ATT TTA CGT TTT CGG AAA ACA ACC AAA TAA 3' 
ILRFRKTTK* 
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5 1 ATG GAA CAA AAA AAC GTA CGA TTA TTC CCT GCC GTT TTA GCG ACA GGA ATT ATG 

MEQKNVRLFPAVLATGIM 
TCT TTT GCT GGC GTA TTA ATT GAA ACA GCA ATG AAT GTA ACC TTT CCG ACA TTA 

SPAGVLIETAMNVTFPTL 
ACA AAG GAA TTT GGT GTT TCC ACC GGT ACC GTG CAG TGG GTG ACA ACQ ATT TAT 

TKBFGVSTGTVQWVTTIY 
TTA TTA GTT ATT TCT ATC ATG GTG CCC TTA TCC AAT TAT TTA TTA AAA ACC TAT 

ZiLV I S IM V'PLSNYLL K T Y 
TCG TTA AGA CGC TTA TTT ATT GTT GCG AAT CTT TTC TTT TTG ATT GGT TTA GCG 

SLRRLPIVANLFFIiIGLA 
ATT GAT GTC TAT TCG CCA TCT TTC AGT ATC TTG TTA TTA GGC CGA CTC TTC CAA 

IDVYSPSFSILLLGRLFQ 
GGG GCT AGC ACT GGG ATT GCT TTG CCA CTG ATG TTT CAT ATT ATT TTG AAC TTT 

GASTG IALPLMFHI Z LNF 
ACC CCG CTG GAA AAA CGA KGA ACQ ATG ATG GGG GTA SGC ACA TTG ACC ACT TCA 

TPLBKRXTMMGVXTIiTTS 
ATT GCT CCA GCC ATT GGT CCA ACG TAT GGT GGA ATC TEA ACA TCT TCC CTG TCT 

IAPAIGPTYGGILTSSLS 
TGG CAC GCC ATC TTT TTA TTC TTA ATT CCT ATT TTA CTG CTT TCT TTG TTT ATG 

WHAIFIiFIiIPILLLSIiFM 
GGC TTA TCT GCT ATT CCA GAA ATA CCT GTT AAA AAA ACC ACC ACT CTT GAT TTG 

GIiSAIPBI PVKKTTT LDL 
GTT AGT TTA ATT GGT ATT GCC CTG TTA TTT AGT GGT TTG TTG ATG TTT TTA AGT 

VSLI GIAL LFSGLliMFLS 
AAA ATT GGC ACA TTG TTT GGC TGG CTT TCA CTT CTG GCA GCT GTT ATT GGC TTT 

K I G TIiFGWLSLIiAAVI GF 
GTT ATT TTT TAT AAA CGA GCA ACG ACC GCT GAA CAG CCA CTA GTT CGT TTG ACA 

VIFYKRATTAEQP LVRLT 
ATT TTA AAA AAT CCC GCC TTC GTT TTA TTT TTA TGT GGC TTT TTA GTT TGT CAA 
ILKNPAFVLFLCGFLVCQ 

TTC TTA TTG TTA GGT ATT TCC TTT GTC TTA CCA AAT TTT GTT CAA ATC GTC CTT 
FLItliGI SFVLPNFVQIVL 
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GGA AAA AAT GCG TTT GTA GCT GGG CTA GTC ATG CTG CCA GGT GCT ACA : .GTT: GGG :| 

G K N A FVAGLVMLPGATVG 
GCC ATT TTG GCA CCA CTT TCT GGA CGA GTG TTG GAT CAA TAG GGC GCT AAA AAA 

AILAPLSGRVLDQYGAKK 
CCT ATA TTA TTT GGC TTA AGT TTG GCG ACA ATC GGT TGG CTA GCT TTA ACA ATT 

PILPGLSLATIGWLAIiTI 
TTA CTA GAG ATG CCT GTT TTA TTA GGA TTC GTC GCT GGA CAT GTT ACC TAT ATG 

LLBMPVLLGFVAGH VTYM 
ATT GGT CTA GGT TTT GCA TAG AGT AAT ATG ATG ACG ACT GGT ATG AGC TTG TTA 

IGLGFAYSNMMTTGMSLL 
GAG GAG AAA GAT TTT GGC GAC GGC AAT ACG TTA TTT AAT ACA CTT CAA CAA TTT 

B BKDFGDGNTLFNTLQQF 
TCT GGC GCT ATT GCG ACG GCC ATT GTA GCG ACG ATT ATT AAT ATC GCT CAA GAC 

SGAIATAIVATI INIAQD 
CAC GCA GAT AAT TTT GCT GAG GGG ACA ACG ATG GGC TCC CTC ATT TCT CTG ATT 

HADNFAQGTTHGSLI.SIiI 
TTC TTG TTA ACC TTA TTA GTA ATT GTT TTA ATT GCT TGC TGG AAT TAT TTC CGC 
. FLIiTLLVIVLIACWNYFR 
AAA AAA GCT TAA 3 1 

K K A * 
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5 ' ATG AAA AAC ACC CQA AAC ACC GTC CCA TCA TTA CTA ATT ATT ATT TTG CTT GTG 
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L L 
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GYWSAEKI AFAGLLLS IF 
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GOG GCT TGC TGT TTA TTA ATA ATG ACA CAA TTT GAA ACA ATC ATT GGJ ^fJTT. , ATT * 

AACCLLIMTQPBTI I GIjI 
TTG TTT ATT TTT CTC TTT TTC TTA GGG ATT AAT ATT ACT TTG CCC AAT GCC TTA 

LFIFLFFLGINITLPNAL 
AGT ATG GCC TTA AAA GGC TAT GAA TCT GTA ATT GGT ACT GCC AGT GGG ATT TTT. 

SMALKGYESVIGTASGIF 
AGT TTT GTC TAT TAC TTG TTT GTT AGT TTC TTT ACC TAT TTG ATT AGT TAC TTT 

SFVYYLFVSFFTYLISYF 
CAC AAT GGT ACC ATT TGG GTC TTG CCG CTT TAC TTT TTA AGT TTT GCT TGT CTG 

HNGT I WVL P LYFLS FACI* 
TTA GCT TTC AGC TAT TAT GTC ATT GTT TAT CGT AAA ACT TTA AAA TAA 3 1 

LAFSYYVIV YRKTLK* 



Fig. 35B 



WO 01/79257 PCT/US01/12230 

75/77 

5 1 GTG GGG CTT AAC ATT GTC TTA CCA CTC GTT AAT ATT ATT AAT GGA TTA GOT TGG 
VGLHIVIiPLVKIINGLGW 
ATG TTT GGC GTT GGC GGC GCT ACG TTG TTT TCG ACC ACG GTT GCG CAA AAA GAA 

MFGVGGATIiPSTTVAQKE 
ATC AAA AAA GCC AAT CAG TAT TTT AGT TTA ACG ATT GGG TTA GTT TTC GTT ATA 

IKKANQYPSIiTIGLVFVI 
GGG AGT CTC TTT ACT TTA GTG AGC TTG ATT TTT TCC GAT CAA ATT ATT CGG GGG 

GS L FTLV S L I F S DQ I I R G 
CTT CAG GGG ACC GGT GTC TTG TTT GGT TTG GCA AAG GAA TAT TAT ATG ATT TAC 

LQGTGVZsFGIi.AKBYYMXY 
CTT AGC TGT TCA TTA TTA TTT ATT TTA AAC AAT TGC CTG ATT ACT TTT TTA AGA 

IiSCSIiLFILNNCLITFLR 
AAT GAC CAT AAT CCT CGT TTG GCA ACG ATT GCT TTT GTC AGT GGG GGA ATT GTC 

NDHNPRLATIAFVSGGIV 
AAC ATT ATT TTG GAT TAT GTC TTT ATT TAT CAA TTT GGG TGG GGT ATG GCA GGT 

HI I LDYVFI YQF GWGMAG 
GCG GCC ATT GCA ACG GTG ATG TCA CCG TTA ACA AGC TTA ATC CTG ATT ACG TTG 

AAIATVMS PLT SLILI TL 
CAT AAG TGG TCG CCG CAA GGA GTT TTA CGT TTT GAA AAG TTT AAA GTG AAA TTT 

HKWSPQRVLRFBKFKVKF 
CAG GAT GTT CGA GAA ATT ATG TCG ATT GGT TTT TCC TCC TTT TTA AAT GAA TTT 

QD VREIMS XG'FSSFLNEF 
TCT TCG GCC TTT GTC ATG TTT TTA TTT AAT ATT GTT TTG TTA AAC TTA GTT GGT 

SSAFVMFLFNIVLLNLVG 
CAT GTG GCG ATT TCA GCT TAT GCG ATT GTT GCC AAT CTC AAT ATT ATT GTG ATT 

H V A I SAYA I V ANLN I I V I 
GCT ATT TTT ACA GGG ATT GGG CAA GGG GCG CAA CCA TTA CTC AGC CGA TAC TAT 

AIFTGIGQGAQPLLSRYY 
GGT TTA GGG GAA ACG AAG GTG TTG CGT AAA TTT GTT AAA CTT AGC TTT ATT ACG 

GLGETKVLRKF VKIiSFIT 
TAC TTG GTC GCC GGC TTC CTG TTC TTT TTA ATC AGT CAA GTG TTT ACG GGA CAA 
YLVAG FIiFFLI SQVFTG.Q 
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ATT ATT GAG GTC TTT ART AGC QAA GGG ARC GAC CAG TTG GCG CAA ATT. OCT AGO 



IIEVPNSEONDQLAQIAR 
ACA GCC ATC CGC TTA TAT GOG ATT TCC TTC TTG TTT ACA GGA TTG AAT TTT ATG 



T A I R 
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I S 
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G I Y Y 
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S A 


V R 


KPKMALM I S 


TCT TTG CGT GGG 


TTC 


TTT TTA ATT GTG 


CCA GTT TTA TTT ATT TTG GTG AAA TTA 


S la R G 


F 


F L 


I V 


PVLFI LVKL 


CTA GGA TTA ACC 


GGT 


GTT TGG 


TTA GCC ATG CCA GTA GTT GAA TTT GTT ACG TTT 


I* G L T 


G 


V W 


L A 


MPVVBFVTF 


GGA CTA ATG CTT 


GTG 


GGC TAC 


CTT GCG 


TAT CGA AAC TAT TTO AAA AAA AGA GAA 


G L M L 


V 


G Y 


L A 


YRNYLKKRE 



GCA GTA ACC TGA 3 » 
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5» ATG CCA TAT TTA TAT TTG TTA TTA GCA ATA GTC AGT GAA GTA ATA GGC'*^&3T^GfeA > - 
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p y 


L Y 


L L 
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